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The cover illustration is an artist's representation of the so-called golden 
rectangle. If the largest square in a golden rectangle is cut away, then the 
figure remaining will also be a golden rectangle. Such rectangles are charac¬ 
terized by a length to width ratio of (1 -I- V 5 )/2, the golden ratio. The ancient 
Egyptians may have used this ratio in the construction of pyramids. The ratio 
recurs often in number theory; for example. 
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where D,(n) and D->(n) are the partition functions occurring in the Rogers- 
Ramanujan identities, and F„ is the nth Fibonacci number. 
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To Joy and Amy 




PREFACE 


Most mathematics majors first encounter number theory in courses 
on abstract algebra, for which number theory provides numerous 
examples of algebraic systems, such as finite groups, rings, and fields. 
The instructor of undergraduate number theory thus faces a predica¬ 
ment. He must interest advanced mathematics students, who have 
previously studied congruences and the fundamental theorem of 
arithmetic, as well as other students, mostly from education and liberal 
arts, who usually need a careful exposition of these basic topics. 

To interest a class of students whose backgrounds are so diver¬ 
gent, this text offers a combinatorial approach to elementary number 
theory. The rationale for this point of view is perhaps best summarized 
by Herbert Ryser in Combinatorial Mathematics: “. . . combinatorics 
and number theory are sister disciplines. They share a certain inter¬ 
section of common knowledge and each genuinely enriches the 
other/"* In studying number theory from a combinatorial perspective, 
mathematics majors are spared repetition and provided with new 
insights, while other students benefit from the consequent simplicity 
of the proofs for many theorems (the proofs of Theorems 1-3, 3-4, 
3-5, 6-1, 6-2, 6-3, 7-6, 8-4, and 9-4 rely mainly on simple com¬ 
binatorial reasoning). Number theory and combinatorics are combined 
in Chapters 10 through 15 to aid in the discovery and proof of 
theorems. 

Two aspects of the text require preliminary discussion. First, 
Section 5 of Chapter 3 is critical to the whole work. This section illus¬ 
trates both the value of numerical examples in number theory and the 
role of computers in obtaining such examples. The accompanying 
exercises provide opportunities for constructing numerical tables, 
with or without a computer. Subsequent chapters may then be intro¬ 
duced to advantage by allowing students to report on conjectures 
they derive from relevant numerical tables. When students are thus 
actively involved, theorems will seem natural and well motivated. 

*Ryser, Herbert J., Combinatorial Mathematics (Carus Monograph No. 14). 
Mathematical Association of America, 1963. 
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PREFACE 


Second, in Chapters 12, 13, and 14, the student will encounter 
partitions, a topic in additive number theory. Too often, one obtains 
from number theory texts the impression that each topic has been 
thoroughly developed. The problems offered in such texts are either 
solved or unsolvable; at best, the student is invited to work a few 
peripheral problems. In this book, Chapter 12 attempts to com¬ 
municate the excitement of the mathematical chase by devising a 
procedure for forming conjectures in partition theory. The exercises 
at the end of Chapter 12 provide the student with a number of oppor¬ 
tunities for discovering theorems himself. Chapters 13 and 14 develop 
techniques in the application of generating functions to partition 
theory so that the student can prove some of the conjectures he made 
in Chapter 12. In presenting Chapter 14, the instructor should assign 
the exercises at the end prior to beginning lectures, in order to avoid 
the unmotivated presentation of complicated manipulations of series 
and products; through this procedure, the student is led to appreciate 
the relation between the exercises and the steps in the proof of the 
Rogers-Ramanujan identities and of Schur’s Theorem. 

Many people have aided me in preparing this book. I express 
particular thanks to the students in my class of Spring Term, 1970, 
at Penn State, who were taught from the completed text and who gen¬ 
erously offered valuable suggestions. Professors H. L. Alder, G. L. 
Alexanderson, and G. Piranian read the entire manuscript and made 
many useful contributions. Carlos Puig and George Fleming of 
W. B. Saunders have skillfully guided the process of publication. 

Finally I pay tribute to my wife, Joy, who has been the most 
important helper in the creation of this book; at each stage her en¬ 
couragement, intelligence, and energy have added significant value. 
She has been immensely creative both in writing expository material 
and in facilitating the communication of ideas to students. Without 
her aid, a mass of scribbled lecture notes would still be just that. 

For certain classes where the instructor deems it wise to omit 
material requiring calculus, I recommend using all or part of the 
following: Chapters 1, 2, 3 (omit Sections 3-3 and 3-4), 4, 5, 6, 7, 8 
(omit Section 8-2), 9, 10 (omit Section 10-2 except for a brief discus¬ 
sion of Corollary 10-1), 11 (omit Section 11-2 save for a summary of 
the results), 12, and 15 (up to Definition 15-1). 


George E. Andrews 
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PART I 


MULTIPLICATIVITY- 

DIVISIBILITY 


Part I is devoted to multiplicative problems; these are 
sometimes called divisibility problems, since division is the 
inverse of multiplication. 

The knowledge of divisibility that we gain in the first 
two chapters leads us to our first goal, the fundamental 
theorem of arithmetic, which discloses the important role of 
primes in multiplicative number theory. Chapter 3 introduces 
combinatorial techniques for solving important divisibility 
problems and answering other number-theoretic questions. 
In order that we can study divisibility problems in greater 
depth, Chapters 4 and 5 develop the theory of congruences. 
Chapter 6 discusses some of the important functions related to 
multiplication and division, for example the number d(n) of 
divisors of n and the sum o-(n) of the divisors of n. Our results 
on congruences are extended in Chapter 7. The final chapter 
of Part I is concerned with the distribution of primes. 


1 



CHAPTER 1 


BASIS REPRESENTATION 


Our objective in this chapter is to prove the basis representation 
theorem (Theorem 1-3). First we need to understand the principle of 
mathematical induction, a tool indispensable in number theory. 


1-1 PRINCIPLE OF MATHEMATICAL INDUCTION 

Let us try to answer the following question: What is the sum of all 
integers from one through n, for any positive integer n? If n= 1, the 
sum equals 1 because 1 is the only summand. The answer we seek is 
a formula that will enable us to determine this sum for each value of n 
without having to add the summands. 

Table 1-1 lists the sum S n of the first n consecutive positive in¬ 
tegers for values of n from 1 through 10. Notice that in each case S n 
equals one-half the product of n and the next integer; that is, 


S 


n 


n (n + 1) 
2 


0 - 1 - 1 ) 


for n = 1,2,3,^. . .,10. Although this formula gives the correct value 
of S n for the first ten values of n, we cannot be sure that it holds for n 
greater than 10. 

To construct Table 1-1, we do not need to compute S n each time 
by adding the first n positive integers. Having obtained values of S n 


Table 1-1: Sum S n of the First n Consecutive Positive Integers. 


n 


S„ 


n 


1 

2 

3 

4 

5 


1 

3 

6 

10 

15 


6 

7 

8 
9 

10 


21 

28 

36 

45 

55 


3 
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BASIS REPRESENTATION 


for n less than or equal to some integer k, we can determine S k + 1 
simply by adding (k + 1) to S k : 

S k +1 = S k + (k -f 1). 

This last approach suggests a way of verifying equation (1-1-1). 
Suppose we know that formula (1-1-1) is true for n ^ k, where k is a 
positive integer. Then we know that 

c _ k(k + l) 

Zk 2 

and so 

Sfr + i = Sk + (k+ 1) 

= fc(fc+D + (fc + 1) 

= 

(fc + 2)(fc-ny 

2 

that is, 

_ (fc+l)((fc+l) + l) 

^ + 1 2 

The last equation is the same as equation (1-1-1) except that n is 
replaced by k + 1. 

We have proved that if equation (1-1-1) holds for n ^ k, then it 
holds for n= fc+ 1, and we have already verified that equation (1-1-1) 
holds for n = 1,2, . . ., 10. Therefore, by the preceding argument, we 
conclude that equation (1-1-1) is also correct for n = ll. Since it 
holds for n = 1,2,..., 11, the same process shows that it is correct for 
n = 12. Since it is true for n — 1,2, . . ., 12, it is true for n = 13, and 
so on. We can describe the principle underlying the foregoing 
argument in various ways. The following formulation is the most 
appropriate for our purposes. 

Principle of mathematical induction: A statement about 
integers is true for all integers greater than or equal to 1 if 
(i) it is true for the integer 1 , and 

(ii) whenever it is true for all the integers 1,2 , .. .,k, then it is 
true for the integer k + 1. 

By “a statement about integers” we do not necessarily mean a formula 
A sentence such as “n(n 2 — 1) (3n + 2) is divisible by 24 is also 
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acceptable (see Exercise 17 of this section). The assumption that “the 
statement is true for n = 1,2, ... ,k” will often be referred to as the 
induction hypothesis. Sometimes the role 1 plays in the Principle will 
be replaced by some other integer, say b; in such instances the princi¬ 
ple of mathematical induction establishes the statement for all 
integers n ^ b. 

Since we have shown that equation (1-1-1) fulfills conditions (i) 
and (ii), we conclude by the principle of mathematical induction that 
(1-1-1) is true for all integers n ^ 1. We state this result as our first 
theorem. 

Theorem 1-1: The sum of the first n positive integers is 
n(n + 1) 

2 

Our next theorem also illustrates the use of the principle of 
mathematical induction. 

Theorem 1-2: If x is any real number other than 1, then 

Yx J = l + x + x 2 + ... +x n ~ 1 = - -^ 

} =o x-1 

n -1 

Remark: ^ A is shorthand for A 0 + A x + A 2 + . . . + A n _,. 

3 = 0 

PROOF: Again we proceed by mathematical induction. If 

i-i 

n — 1, then ^ x 3 = x° = 1 and (x — l)/(x — 1) = 1. Thus the theorem 

3 = 0 

is true for n = 1. 

k -1 

Assuming that ^ x j = ( x k - 1 )/(x - 1), we find that 

3 = 0 


(/c+l)-l 

2 


5 = 0 


k j y n | 

X j = ^ X j + X* — —T- 

**-o x 1 


x^ — 1 H- x k + 1 — x k 
x — 1 

x^ + 1 - 1 

X — 1 


Hence condition (ii) is fulfilled, and we have established the 
theorem. g 

Corollary 1-1: If m and n are positive integers and if 
m > 1, then n < m n . 



6 


BASIS REPRESENTATION 


Proof: Note that 

m n — 1 

n=l + l + l + ... + l^l + m + m 2 + .. . + m n ~ 1 = - r 


n terms 


m n — 1 < m n . 


EXERCISES 

1. Prove that 


l 2 + 2 2 + 3 2 + . . . + n 2 


n(n + 1) (2n + 1) 

6 


2. Prove that 

l 3 + 2 3 + 3 3 + . . . + n 3 = (1 + 2 + 3 + . . . + ft) 2 . 
[Hint: Use Theorem 1-1.] 

3. Prove that 

x n — y n = (x — y) (x n ~ 1 T x n ~ 2 y + . . . + xy n ~ 2 + y n ~ 1 ). 

4. Prove that 


1*2 + 2*3 + 3*4 + . + n(n + 1) = 


n(n +1) (n + 2) 
3 


5. Prove that 


1 + 3 + 5 + . . .4- (2n — 1) = ft 2 . 


6. Prove that 

+_i_=_«_ 

2-1 2-3 3-4 n(n +1) n+1' 

7. Suppose that F t = 1, F 2 = 1, F 3 = 2, F 4 = 3, F 5 = 5, and in 
general F tt =F n _ 1 -f-F n _ 2 for ft ^ 3. (F n is called the ftth 
Fibonacci number.) Prove that 


Fi + F 2 + F 3 + . . . + F n — F n + 2 1* 
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In Exercises 8 through 16, F n stands for the nth Fibonacci 
number. (See Exercise 7.) 

8 . Prove that 


Fi + F 3 + F 5 4- . . . + F 2n -1 — F 2n . 

9. Prove that 

F 2 4" F 4 + F 6 -h . . . + F 2 n = F 2 n + 1 1 • 


10 . Prove that 


F 2 n+1 ~F n F n+2 = (-1)» 

11 . Prove that 

FjF 2 4- F 2 F 3 4- F 3 F 4 + . . . 4- F 2n - 1 F 2n = F 2n . 

12 . Prove that 

FjF 2 4- F 2 F 3 4- F 3 F 4 4- ... 4- F 2n F 2n + 1 = F 2w + 1 — 1. 

13. The Lucas numbers L n are defined by the equations 
Lx = 1, and L n = F n + 1 4- F n _x for each n ^ 2. Prove that 

F w L n — 1 4” L n _ 2 (n — 3). 

14. What is wrong with the following argument? 

“Assuming L n = F n for n = 1,2, . . ., k, we see that 

L k + 1 = L k 4- Lfc-i (by Exercise 13) 

= F /C + F /C _ 1 (by our assumption) 

= F fc + 1 (by definition of F k+1 ). 

Hence, by the principle of mathematical induction, 
F n = L n for each positive integer n.” 

15. Prove that F 2n = F n L n . 

16. Prove that 


Lj 4- 2L 2 4- 4L 3 4- 8 L 4 4- . . . 4 - 2»" 1 L B = 2 n F w + 1 - 1. 
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17. Prove that n(n 2 — l)(3n +2) is divisible by 24 for each 
positive integer n. 

18. Prove that if n is an odd positive integer, then x + y is a 
factor of x n + y n . (For example, if n = 3, then x 3 + y 3 = 
(x + (/)(x 2 -xt/-bt/ 2 ).) 


1-2 THE BASIS REPRESENTATION THEOREM 

Early in grade school, you learned to express the integers in 
terms of the decimal system of notation. The number ten is said to be 
the base for decimal notation, because the digits in any integer are 
coefficients of the progressive powers of 10. 

Example 1-1: In the decimal system, two hundred nine is 
written 209, which stands for 

2 • 10 2 + 0 • 10 1 + 9 • 10°. 

Similarly, for four thousand one hundred twenty-nine we write 4129, 
which stands for 


4 • 10 3 H- 1 * 10 2 + 2 • 10 1 + 9 * 10°. 

We can likewise express integers in binary, or base two, notation. 
In this case the digits 0 and 1 are used as the coefficients of the 
progressive powers of 2. 

Example 1-2: In binary notation, we write twenty-three as 
10111, which stands for 

1 * 2 4 + 0 * 2 3 + 1 - 2 2 + 1 - 2 1 + 1 - 2 °, 
and thirty-six has the form 100100, which stands for 

1 -2 5 + 0-2 4 -h0-2 3 + 1 -2 2 + 0-2 1 + 0-2°. 

The basis representation theorem states that each integer greater 
than 1 can serve as a base for representing the positive integers. 

Theorem 1-3 (Basis Representation Theorem): Let k be any 
integer larger than 1 . Then , for each positive integer n, there exists a 
representation 


n = a 0 k s + a x k s 1 + . . . + a 


(1-2-1) 
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where a 0 # 0, and where each a { is nonnegative and less than k. 
Furthermore, this representation of n is unique; it is called the 
representation of n to the base k. 

Remark: For each base k, we can also represent 0 by letting 
all the a t be equal to 0 . 

PROOF: Let b k (n) denote the number of representations of n to 
the base k. We must show that b k (n) always equals 1. 

It is possible that some of the coefficients a, in a particular 
representation of n are equal to zero. Without affecting the representa¬ 
tion, we may exclude terms that are zero. Thus suppose that 

n — a 0 k s + a x k s ~ x + . . . + a s - t k l , 

where now neither a 0 nor a s - t equals zero. Then 

n — 1= a 0 k s + a 1 k s ~ 1 + . . . + a s - t k l — 1 

= a^k 8 + a x k s ~ x + . . . + (a s - t — 1) k l + k f — 1 

= a 0 k s + a t k s -' + . . . + (a s - t - l)k* + ‘j? (k-l)k j , 

3 = 0 

by Theorem 1-2 with x—k. Thus we see that for each representation 
of n to the base k, we can find a representation of n — 1 . If n has 
another representation to the base k, the same procedure will yield a 
new representation of n — 1 . Consequently 

b k (n) < b k (n - 1). (1-2-2) 

It is important to note that inequality (1-2-2) holds even if n has no 
representation because b k {n) = 0 < b k (n- 1 ) in that case. Inequality 
(1-2-2) implies the following inequalities: 

b k (n + 2) < b k (n + 1) =£ b k (n), 

b k (n + 3) < b k (n +2) < b k (n + 1) < b k (n ), 

and, in general, if m ^ n + 4 , 


b k {m) ^ b k (m~ 1) < b k (m- 2) < . . . ^ ^(n + 1) < b k (n). 

Since k n > n by Corollary 1-1, and since k n clearly has at least one 
representation (namely, itself), we see that 


1 ^ b k (k n ) < b k (n) < b k ( 1) = 1. 
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The extreme entries in this set of inequalities are ones, so that all of 
the intermediate entries must be equal to 1. Thus b k (n) = 1, and 
Theorem 1-3 is established. H 

Once a base k(k > 1) has been chosen, we can represent any 
positive integer n uniquely as a sum of multiples of powers of k : 

n = a s k s + a s - x k s ~ l + . . . + a x k + a 0 

where a 0 , a l9 . . ., a s stand for nonnegative integers less than k. This 
representation is usually denoted by the symbol “a s a s - 1 . . . 

(not a product). For bases less than or equal to ten, the a t are 
chosen from the symbols 0 , 1 , 2 , . . ., 9 with their usual meanings; 
however, if k is greater than ten, one must invent additional symbols 
in order to have a total of k different symbols (one for each of the 
integers from zero to k— 1 ). 

Example 1-3; Let A stand for ten; B, for eleven; C, for twelve; 
D, for thirteen; F, for fourteen; and F, for fifteen. Using these symbols, 
we write three hundred to the base sixteen as 12 C, that is, 

1 * 16 2 + 2 • 16^ 12 • 16°. 

Similarly, two hundred is represented as C 8 ; one hundred, as 64; 
and ten, as A. 

This ability to represent integers to any base greater than one is 
much more than a mathematical curiosity. The bases 2, 8 , and 16 are 
important in computer science. More useful to us, however, is the 
applicability of Theorem 1-3 in the proofs of many results about in¬ 
tegers. We shall obtain some of these results in the next chapter. 


EXERCISES 

1. Write the numbers twenty-five, thirty-two, and fifty-six to 
the base five. 

2. Write the numbers forty-seven, sixty-eight, and one 
hundred twenty-seven to the base 2. 

3. What is the least number of weights required to weigh any 
integral number of pounds up to 63 pounds if one is allowed 
to put weights in only one pan of a balance? 

4. Prove that each integer may be uniquely represented in 
the form 
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n = ^ c s 

j = 0 

where c s ^ 0, and each Cj is equal to —1, 0, or 1. 

5. Using Exercise 4, determine the least number of weights 
required to weigh any integral number of pounds up to 80 
pounds if one is allowed to put weights in both pans of a 
balance. 

6. Prove that if 


a s k s + + . . . + a 0 

is a representation of n to the base k , then 0 < n ^ k s + 1 — 1. 
[Hint: Use Theorem 1-2.] 

7. Without using Theorem 1-3, prove directly that two dif¬ 
ferent representations to the base k represent different 
integers. [Hint: Use Exercise 6.] 



CHAPTER 2 


THE FUNDAMENTAL THEOREM 
OF ARITHMETIC 


In every branch of mathematics we meet theorems that seem so 
natural that, if we held no respect for logical rigor, we would be 
tempted to takethem for granted. We must prove such theorems care¬ 
fully, not only because they may be crucial in the logical structure of 
the theory, but also because every few years some proposition whose 
denial has long appeared to be utterly unacceptable to common sense 
turns out to be false. 

You are now acquainted with one of these important theorems, 
the basis representation theorem (Theorem 1-3). This chapter will 
culminate in another basic proposition, the fundamental theorem of 
arithmetic (Theorem 2-5), from which we shall obtain significant in¬ 
formation about the multiplicative structure of the integers. In passing, 
we note that a certain apparently obvious extension of the theorem 
to other number-theoretic structures resembling the integers is false 
(see Exercise 1 in Section 2-5). 

We begin by developing Euclid’s division lemma (Theorem 
2-1), by means of which we shall study the divisibility properties of 
integers (Theorems 2-2 and 2-3). Knowledge of these properties will 
enable us to prove the fundamental theorem of arithmetic. 

2-1 EUCLID'S DIVISION LEMMA 

The division lemma furnishes the foundation for much of number 
theory; yet it is simply a rigorous restatement of the well-known fact 
that division of one integer by another yields an integral quotient 
and an integral nonnegative remainder smaller than the divisor. In 
order to avoid unnecessary complications, we limit ourselves to posi¬ 
tive divisors. The proof we shall give for the lemma relies heavily on 
the basis representation theorem. 


12 
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Theorem 2-1 (Euclid's Division Lemma): For any integers 
k (k > 0 ) and j , there exist unique integers q and r such that 
0 < r < fc and 


j=qk + r. ( 2 - 1 - 1 ) 

PROOF: Note that we have simply rewritten a division problem 
in terms of multiplication and addition. In the notation used above, j 
is the dividend; fc, the divisor; q, the quotient; and r, the remainder. 

If k = 1, r must be zero, so that q =j. 

If k > 1, suppose first that j > 0. (We shall consider the cases in 
which j = 0 and j < 0 later.) By the basis representation theorem 
(Theorem 1-3), j has a unique representation to the base k , say 

j = a s k s + ag-ik 8-1 + . . . + a t k + a 0 
— k(a s k S ~ 1 + a s ^ l k s ~ 2 + . . . + a x ) + a 0 
= kq + r, 

where 0 ^ r = a 0 < k. 

If a second pair q' and r' existed, we could find a representation 
for q' to the base k , say 


q f — b t k f -f- . . . + b x k + 

so that 

j = kq ' + r' 

= b t k t+1 + fcjfc 2 + b 0 k + r', 

but / 

J — + a s ^ 1 k s ~ 1 + . . . + aj/c + a 0 . 

By the uniqueness of the representation of j to the base k, we see that 
£ = s — 1 , b t — a i+u r' = a 0 = r, and thus 

q' = b t k ( + . . . + b x k + b 0 
= a^- 1 + . . . + a 2 k + 

= Q- 

Consequently, the theorem is true for positive values of j. 

If j = 0, it is easy to verify that q = r = 0 is the only possible solu¬ 
tion of ( 2 - 1 - 1 ) with 0 ^ r < k. 
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If j < 0, then —j > 0, and there exist unique integers q" and r" 
such that 


-j = kq" + r". 

Ifr" = 0, then j=k(—q"); thus we may take q = —q" and r = 0. 
If r" # 0, then 


j = —kq" ~ r" 

= k(-q"~ 1) + ( k-r"), 

and we may take q — —q" — 1, and r=k — r". 

In either case, q and r satisfy equation (2—1—1). Uniqueness for 
negative j follows from uniqueness for —j, which is then positive. ■ 


EXERCISES 

1. Without assuming Theorem 2-1, prove that for each pair of 
integers j and k(k >0), there exists some integer q for 
which j — qk is positive. 

2. The principle of mathematical induction is equivalent to 
the following statement, called the least-integer principle: 
Every non-empty set of positive integers has a least 
element. 

Using the least integer principle, define r to be the least 
integer for which j — qk is positive (see Exercise 1). Prove 
that 0 < r — k. 

3. Use Exercise 2 to give a new proof of Theorem 2-1. 

4. Any set of integers / that fulfills the following two condi¬ 
tions is called an integral ideal: 

(i) if n and m are in J, then n + m and n — m are in/; and 
(ii) if n is in J and r is an integer, then rn is in/. 

Let be the set of all integers that are integral multiples 
of a particular integer m. Prove that f m is an integral ideal. 

5. Prove that every integral ideal / is identical with for 
some m. [Hint: Prove that if/ ^ {0} = J n , then there exist 
positive integers in /. By the least-integer principle 
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(Exercise 2), there is a least positive integer in J , say m. 
Then prove that J = J> m .\ 

6. Prove that if a and b are odd integers, then a 2 — b 2 is 
divisible by 8. 

7. Prove that if a is an odd integer, then {a 2 + (a + 2) 2 + 
(a-f 4) 2 + l} is divisible by 12. 


2-2 DIVISIBILITY 

If a and b (b ^ 0) are integers, we say b divides a , orb is a divisor 
o f a, if a/b is an integer. We shall write b \ a to indicate that b divides 
a; and, bfa, to indicate that b does not divide a. 

Example 2-1: 2 | 4, but 3T4. 

Example 2-2: If a is an integer, then 1 | a and —1 | a; further¬ 
more, if a 0, then a | a and —a | a . 

Example 2-3: For each nonzero integer a, a | 0. 

Example 2-4: Let a, b , c, and d be integers. Suppose that an 
integer e divides both a and c. Then there exist integers x and y such 
that a — ex and c = ey. Therefore, 

ab + cd= exb + eyd 
= e(xb + yd). 


which implies that e \ (ab + cd). Consequently, if e \ a and e | c , then 
e | (ah + cd) also. 

If a and b are integers, then any integer that divides both a and b 
is called a common divisor of a and b. 

Definition 2-1: If a and b are integers , not both zero , then an 
integer d is called the greatest common divisor of a and b if 

(i) d > 0, 

(ii) d is a common divisor of a and b, and 

(iii) each integer f that is a common divisor of both a and b is 
also a divisor of d. 

We shall prove shortly that each pair of integers a and b , not both 
zero, has a unique greatest common divisor; this integer is denoted 
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by g.c.d .(a,b). Many authors write (a,b) for g.c.d .(a,b). We do not, 
because we shall often use (a,b) to represent a point in the Euclidean 
plane. 

Example 2-5: The positive divisors of 12 are 1, 2, 3,4,6, and 12. 
The positive divisors of—8 are 1, 2, 4, and 8. Thus the positive com¬ 
mon divisors of 12 and —8 are 1, 2, and 4; hence, g.c.d.(12,-8) =4. 

Example 2-6; If a 0 and a \ b , then g.c.d. (a, b) — \ a |. 

Our proof of the existence and uniqueness of the greatest common 
divisor depends completely on the Euclidean algorithm, a device 
involving nothing more than repeated application of the division 
lemma. Before proceeding with the proof, we illustrate the Euclidean 
algorithm with the following example. 

Example 2-7: What is g.c.d.(341,527)? Dividing 341 into 527, 
we find that the q and r, as in Theorem 2-1, are 1 and 186, respec¬ 
tively, because 


527 = 341 *1 + 186 (2-2-2) 

Clearly, any number that divides both 527 and 341 also divides 186; 
for, if dc = 527 and de = 341, then d{c-e) = 186. 

In the same manner, 

341= 186*1 + 155, (2-2-3) 

186 = 155 *1+31, and (2-2-4) 

155 = 31 * 5. (2-2-5) 

By equation (2-2-5), 31 divides 155. Therefore 31 divides 186, by 
(2-2-4); 31 | 341, by (2-2-3); and 31 | 527, by (2-2-2). Thus 31 
satisfies (i) and (ii) in Definition 2-1. Finally, if/| 341 and/| 527, 
then /| 186, by (2-2-2); /1 155, by (2-2-3); and f\ 31, by (2-2-4). 
Since all three conditions in Definition 2-1 are satisfied, we see that 
31 = g.c.d.(341,527). 

The proof of Theorem 2 involves nothing more than the proce¬ 
dure of Example 2-7 in a general setting. 

Theorem 2-2: If a and b are integers , not both zero , then 
g.c.d.(a, b) exists and is unique. 
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Proof: Clearly g.c.d .(a,b) is not affected by the signs of a and 
b. We have asserted that not both a and b are zero; however, if either 
is zero, say b = 0, then g.c.d. (a, 0) is clearly equal to \a\. In the 
following proof, we may therefore assume that a ^ b > 0. 

By Theorem 2-1, there exist q t and r x (0 ^ r x < b) such that 


a = bq x + fj. 


If r x > 0, there exist q 2 and r 2 such that 

b = r x q 2 + r 2 , 

where 0 ^ r 2 < r t . If r 2 > 0, there exist q 3 and r 3 such that 


ri = r 2 q 3 + r 3 . 


where 0 ^ r 3 < r 2 . This process may be continued as long as the 
newly arising r x does not equal zero. 

Since 


b > u > r 2 > r 3 > ... > 0, 

we see, by mathematical induction, that 0 < < fo — i. Therefore, in 

at most b steps, we shall obtain an r n that is zero. 

Thus the last application of Theorem 2-1 in our procedure leads 
to the result 


r n -2 = r tt _iq n + 0; 

that is, r n = 0. The computation of g.c.d.(341,527) in Example 2-7 
suggests that r n ^ t is equal to g.c.d. (a, b). 

We have constructed the r { so that r n _ x > 0. By working backward 
from the final equation, we may establish successively that r n - t 
divides r w _ 2 , r n _ 3 , . . ., r 2 , r u b 9 and a. Finally, if/divides both a 
and b , we may proceed successively from the initial equation to deduce 
that f divides r l9 r 2 ,. . ., r n _ 2 , and r n _ x . (Mathematical induction is 
tacitly used in both of these procedures.) Thus r n _j satisfies the re¬ 
quirements of Definition 2-1; therefore, /•„_! = g.c.d.(#,&). 

Each pair of integers has only one greatest common divisor; for, 
if both di and d 2 are greatest common divisors of some pair a and b , 
it follows from (iii) of Definition 2-1 that gdi~d 2 and hd 2 = d 1 , 
where h and g are positive integers; hence, d 2 = ghd 2 ; thus 1 = gh , 
and so g = h = 1. We conclude that d 1 — d 2 . ■ 
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An integral linear combination of the integers a and b is an ex¬ 
pression of the form ax + by, where x and y are integers. We shall 
prove two corollaries of Theorem 2-2 that characterize those integers 
expressible as integral linear combinations of a particular pair of 
integers. First we consider an example. 

Example 2-8: Using the results in Example 2-7, we shall 
express 31 = g.c.d.(341,527) as an integral linear combination of 341 
and 527. We start with the next-to-the-last equation and successively 
substitute the other equations into it until we reach the initial equa¬ 
tion. Equation (2-2-3) may be rewritten as 

31 = 186 - 155 • 1. 

Using equation (2-2-3) to express 155, we find that 
31 = 186 - (341 - 186 • 1), 


that is, 

31 = 2 • 186-341. 

Using equation (2-2-2) to express 186, we see that 
31=2 • (527-341 • 1) -341, 

that is, 

31 = 2-527-3 -341. 

Thus we have expressed 31 as a linear combination of 341 and 527. 
Note that, in addition, 

31 = 14 • 341 - 9 • 527, 

and 

31 = -20 -341 + 13 • 527. 

In general, there may be many pairs x and y such that 

g.c.d .(a,b) = ax + by. 

COROLLARY 2-1: If d= g.c.d.(a,fc), then there exist integers 
x and y such that 


ax + by = d. 


(2-2-6) 
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PROOF: By taking the n equations used in the proof of Theorem 
2-2 and using the principle of mathematical induction, we shall first 
establish that there exist integers x f and y { such that 

axi + by { = (2-2-7) 


for i = 1,2,. . ., n — 1. 

When i = 1, let x x = 1, and y 1 = —q 1 . Now assume that integer 
solutions of (2-2-7) have been found for all i less than or equal to 
k ( k < n — 1). We know that 


r k -i = r k q k + 1 4- r k + 1 ; (2-2-8) 

thus, by the induction hypothesis, 

{ax k . 1 + by k - x ) - ( ax k 4 by k )q k + 1 = r k+1 . (2-2-9) 

We can rewrite equation (2-2-9) in the form 


(**-1 - *kQ k + 1 )a+ (y k -1 - y k q k+ i)b = r k+1 . (2-2-10) 

Hence, x k + 1 = x k - t — x k q k+1 and y k + l = y k - t — y k q k+1 are solutions 
of equation (2-2-7) when i = k + 1. 

Thus formula (2-2-7) is established for i = 1,2,. . ., n — 1, by 
the principle of mathematical induction. In particular, if i = n — 1 
in equation (2-2-7), we get the relation 

ax n - x 4 by n - x = = g.c.d.(a,h). ■ 

Corollary 2-2: In order that there exist integers x and y 
satisfying the equation 

ax+by = c , (2-2-11) 

it is necessary and sufficient that d | c, where d= g.c.d.(a,b). 

Proof: Let a = ed, b — fd. Then, if (2-2-11) holds, we get the 
relation 


c — edx 4- fdy = d(ex 4- fy). 


Thus d | c . 


On the other hand, if d | c, let kd= c. Then, by Corollary 2-1, 
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there exist x' and y' such that 

ax' + by' = d. 


Hence 

a(x'k) + b(y'k) = dk = c. 


Thus x = x'k and t/ = y'k provide a solution of (2-2-11). ■ 

Our next theorem follows from Corollary 2-2; it will be the prin¬ 
cipal tool we shall use in our proof of the fundamental theorem of 
arithmetic. First we need some further definitions. 

Definition 2-2: A positive integer p other than 1 is said to be 
a prime if its only positive divisors are 1 and p. 

The first few primes are 2, 3, 5, 7, 11, .... (Although the 1968 
World Almanac lists 1 as a prime, it is convenient not to do so in 
number theory. As you will see, the statement of the fundamental 
theorem of arithmetic would be needlessly complicated if 1 were 
considered prime. Perhaps this fact has been impressed on the editors 
of the Almanac, for the 1969 and later editions do not list 1 as a prime.) 
The primes have many interesting properties, some of which we shall 
explore in later sections. 

Definition 2-3: We say that a and b are relatively prime if 
g.c.d .(a,b) = 1. 

Example 2-9; The positive divisors of 7 are 1 and 7. The positive 
divisors of 27 are 1, 3, 9, and 27. Since 1 is the only positive common 
divisor of 7 and 27, these two integers are relatively prime. 

Example 2-10; If d = g.c.d. (a,b) , then a\d and b\d are relatively 
prime. To show this, let x and y be integers such that ax-\-by = d. 
Then (a/d)x + (b/d)y = 1, and so g.c.d.(a/d, b/d) = 1. 

Example 2-11: If p is a prime and a is an integer such that 
pfa, then p and a are relatively prime. In particular, any two dif¬ 
ferent primes are relatively prime. 

Theorem 2-3: If a, b , and c are integers , where a and c are 
relatively prime , and ifc | ab , then c divides b. 

PROOF: Since g.c.d. (a, c) = 1, Corollary 2-2 implies that there 
exist integers x and y such that 
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cx + ay = 1. 

Therefore, 

cbx + aby = b. (2-2-12) 

Since c \ ab , there exists a k such that ah = kc. 

Substituting kc for ah in equation (2-2-12), we find that 

cbx + key = h. 

Thus 

c(bx 4- ky) = h. 

Hence c | h. 

Corollary 2-3: 7/a and h are integers, pis a prime, p | ah, and 
p“|“a, then p \ b. 

PROOF: If pfa, then g.c.d.(a,p) = 1, because the only positive 
divisors of p are 1 and p . Hence, by Theorem 2-3 (with c = p), we 
see that p | h. g 

Corollary 2-4: If p | a x a 2 ... a n , then there exists some i 
such that p | a f . 

PROOF: We proceed by mathematical induction. The assertion 
is clear for n = 1. For n = 2, it is a restatement of Corollary 2-3. We 
assume that the assertion is true for n less than or equal to k. Then for 
n — k + 1 we consider the relation 

p | (a 1 a 2 . . . a k )a k + x . 

By Corollary 2-3, either p | a k + l (so that i = k + 1) or p \ a x a 2 . . . a fc , 
in which case p | a* for some i (1 ^ t — A:), by the induction hypo¬ 
thesis. ■ 


(2-2-13) 

(2-2-14) 


EXERCISES 

1. Using the technique described in Example 2-7, find the 
greatest common divisor of the following pairs of integers. 

(a) 527, 765 (d) 108, 243 

(b) 361, 1178 (e) 132, 473 

(c) 12321, 8658 (f) 156, 1740. 
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2. Using the technique described in Example 2-8, find the 
greatest common divisor d of 299 and 481. Then find 
integers x and y such that 

299% + 481 y = d. 

3. In Exercise 2, replace 299 and 481 by 129 and 301 and 
proceed as indicated. 

4. The least common multiple of two positive integers a 
and b (denoted by l.c.m .(a,b)) is defined to be the smallest 
positive integer that is divisible by both a and b. Prove 
that 

l.c.m .(a,b) = - ~ty —rr- 

v g.c.d. (a,Z?) 

5. Compute the following: 

(a) l.c.m.(25,30) (d) l.c.m.(28,29) 

(b) l.c.m.(42,49) (e) l.c.m.(n,rc +1) 

(c) l.c.m.(27,81) (f) l.c.m.(2n-l,2n + 1). 

6. Prove that l.c.m.(ah,ad) ^ a[l.c.m.(b,d)]. 

7. Prove that if D = d/g.c.d.(M) and B = fo/g.c.d.(M), then 

a c__ aD + cB 
+ d l.c.m.(b,d) * 

Discuss the relationship between this equation and the 
addition of fractions by means of a “common denominator”. 

8. Prove that g.c.d.(a + b,a — b) ^ g.c.d,(a,fo). 

9. Prove that, if a and b are nonzero integers, then 
g.c.d.(a,fo) | l.c.m.(a,b), but that (g.c.d.(a,b)) 2 1‘l.c.m.(a,b) 
unless g.c.d.(a,fo) = 1. 

10. Let be the set of all integral multiples of the integer m. 
Prove that 


m D ^ n J^l.c.m.(m,n) * 

[If S and T are sets, then SOT denotes the set of elements 
common to both S and T ]. 





2-3 THE LINEAR DIOPHANTINE EQUATION 


23 


11. Prove that < / e . c .a.(m,n) contains all the elements of and 
all the elements of J n . Prove that if ^edntains all the 
elements of J m and u , then contains all the elements 

of ?.c.d.(m,n) • 

12. We can define a generalized Fibonacci sequence 

& 3 , -^ 4 , ... by first selecting four integers a , b, c, and e, 
and then letting & t = a, ^ = b, and !F n = c^ n -, + e#»- 2 
if n > 2. 

(a) Prove that, if d — g.c.d .(a,b), then d | for all n^l. 

(b) Prove that, if/= g.c.d.(^ m , & m - x ) and ffe, then/1 d. 

2-3 THE LINEAR DIOPHANTINE EQUATION 

We have now amassed enough results to prove the fundamental 
theorem of arithmetic. Before beginning this task, however, we shall 
consider a result related to Corollary 2-2. 

Let a, b, and c be integers (a # 0 ^ b). The expression 

ax + by = c (2-3-1) 

is called a linear Diophantine equation. A solution of this equation 
is a pair ( x,y ) of integers that satisfies the equation. 

From analytic geometry we know that each point in a plane can 
be associated with an ordered pair of real numbers called coordinates. 
A point whose coordinates are integers is called a lattice point. In 
the plane, the locus of points whose coordinates x and y satisfy 
equation (2-3-1) is a straight line. Thus the solutions of this linear 
Diophantine equation correspond to the lattice points lying on the 
straight line. Depending on the values of a, b, and c, there may be 
none or many lattice points on the graph of ax+ by = c. 

From Corollary 2-2 we know that the linear Diophantine equa¬ 
tion ax -I - by ■ c has a solution if and only if d | c, where d — 
g.c.d .(a,b). Suppose that d does divide c. Using the procedure 
illustrated in Example 2-8, we can find w 0 and z 0 such that 

aw 0 + bzo — d. 

Next, we find an integer k such that c = dk; and we let x 0 = w 0 k and 
y 0 = z 0 k. Clearly, (x 0 ,i/ 0 ) is a solution of equation (2-3-1). Suppose 
( x',y ') is also a solution of equation (2-3-1). Then 


ax' + by' — c = ax 0 + by 0 . 
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and so 


Therefore, 


a b , a . b 
d X + d y = d Xo + d y »- 


2 (x'-x 0 )=|(y, -«/'). 


(2-3-2) 


By Example 2-10, g.c.d .(a/d,b/d) = 1; thus, by Theorem 2-3, 


b 

d 


(x' -x 0 ). 


Hence, there exists an integer t such that x'—x 0 =tb/d; that is, 
x' = x 0 +tb/d. Substituting tbld for x'— x 0 in equation (2-3-2), 
we find that 


a b b , 
d f d~ d iy ° 


V'), 


and so 


that is, 


yo -y' 



, . a 

y =y °~ t d- 

We conclude that, for each solution (x',y') of equation (2-3-1), 
there exists an integer t such that 

x' = x 0 +f^ and y' = y 0 -t^- 


In fact, (x 0 + tbld, y 0 — ta/d) is a solution of equation (2-3-1) for each 
t, because 

a (x 0 + t^j + b[y 0 — t^ = ax 0 + by 0 + t^j- — = c - 

We now summarize the preceding results. 


THEOREM 2-4: The linear Diophantine equation 

ax + by = c 

has a solution if and only if d\c, where d= g.c.d.(a,b). Furthermore, 
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if (x 0 ,t/o) is a solution of this equation , then the set of solutions of 
the equation consists of all integer pairs (x,t/), where 

x= x 0 + t~ and y = yo — t — (t = ..., -2,-1,0,1,2,.. .). 

d d 

Example 2-12: The linear Diophantine equation 15x + 27y = 1 
has no solutions, since g.c.d.(15,27) = 3 and 3~f" 1. 

Example 2-13: The linear Diophantine equation 5x + 6y — 1 has 
a solution, since g.c.d.(5,6)= 1. By inspection, we see that (—1,1) is 
such a solution. Hence, all solutions are given by (x,t/), where 
x = -1 + 6t, y = 1 - 5t (t = . . . -2,-1,0,1,2, . . .). 


EXERCISES 

1. Find the general solution (if solutions exist) of each of the 
following linear Diophantine equations: 

(a) 2x + 3y = 4 (d) 23* + 29 y = 25 

(b) 17x +19t/ = 23 (e) lOx — 8y = 42 

(c) 15x + 51y = 41 (f) 121x — 88y = 572. 

2. A man pays $1.43 for some apples and pears. If pears cost 
17e each, and apples, 15$ each, how many of each did he 
buy? 

3. Draw the graphs of the straight lines defined by the equa¬ 
tions in parts (a), (b), and (c) of Exercise 1. 

4. Prove that the area of the triangle whose vertices are 
(0,0), ( b 9 a ), and (x,y) is | by — ax |/2. 

5. Prove that if (x 0 ,{/ 0 ) is a solution of the linear Diophantine 
equation ax — by = 1, then the area of the triangle whose 
vertices are (0,0), (b,a), and (x 0 ,y 0 ) is 1/2. 

6. Is there a nondegenerate triangle with area smaller than 
1/2 and with vertices (p t ,q x ), {p^q^), and (p 3 ,q 3 ), where 
the Pi and q { are integers? Prove your answer. 

7. What is the perpendicular distance to the origin (0,0) from 
the line defined by the equation 


ax — by — 1 ? 
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8. What is the shortest possible distance between two lattice 
points on the line defined by the linear Diophantine 
equation 


ax — by — c? 

(Recall that, by the definition of a linear Diophantine equa¬ 
tion, the constants a, b, and c must be integers.) 


2-4 THE FUNDAMENTAL THEOREM 
OF ARITHMETIC 

Table 2-1 exhibits the ways the first twelve positive integers may 
be factored into primes. 

The evidence of Table 2-1 suggests that there is exactly one 
prime factorization of each integer greater than 1, if the order of the 
prime factors is disregarded. 

While not as intuitively apparent as the basis representation 
theorem (Theorem 1-3), the foregoing conjecture not only is true, but 
is so important to the study of integers that it is called the funda¬ 
mental theorem of arithmetic. 

Theorem 2-5 (Fundamental Theorem of Arithmetic): For each 
integer n > 1, there exist primes p x ^ p 2 — Pz — • • • — Pr such that 


n = p x p 2 . . . p r \ 
this factorization is unique. 

Proof: Our first goal is to prove that each integer has at least 
one prime factorization. Note that (see Table 2-1) such a factorization 

Table 2-1: Factorization of Integers Into Primes. 

n factorizations 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 


2 

3 

2 • 2 
5 

2 *3=3 -2 
7 

2 * 2*2 

3 • 3 

2 *5 = 5 *2 
11 

2 *2 *3 = 2 *3 *2 = 3 *2*2 
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exists for all n(2 ^ n ^ 12). Let us now assume that each integer 
m(l < m ^ k) can be factored into primes. 

Now, either fc + 1 is prime or it is not. If it is prime, then its prime 
factorization consists just of the prime itself. If /c-b 1 is not prime, then 

k + 1 = ab , 

where 1 < a < k -f 1 and 1 < b < k + 1. Since 1 < a ^ k and 1 < fo < 
both a and fo have prime factorizations, say 

a = p x p 2 . . . p s and b = p/p 2 ' . . . p/. 

Therefore, 

/c + l = p 1 p 2 . . . p s p/p 2 ' . . . pi. 

Hence /c + 1 has a prime factorization. Thus we have established by 
mathematical induction that every integer greater than 1 has a prime 
factorization. 

To complete the theorem, we must establish uniqueness of fac¬ 
torization. Again we proceed by mathematical induction. Our table 
also tells us that the factorization of each n (n ^ 12) is unique. Assume 
that each integer m( 1 < m ^ k) has a unique prime factorization. 
Suppose that k 4- 1 has the two prime factorizations 

k + 1 = P 1 P 2 • • • Pu = PIP 2 • • • Pv , 

where p x ^ p 2 < . . . < p u and p/<p 2 '<,..< p Since pi divides 
fc-h 1, we see that pi divides p x p 2 . . . p u ; thus pi divides p { for some i, 
by Corollary 2-4. Since both pi and p x are primes, we conclude that 
Pi = Pi- 

Clearly, we may reverse the preceding argument to show that 
Pi = p/ for some j. Hence 


Pi = Ps ~ Pi> 

and 

Pi = Pi- Pi- 

Therefore, p x > p/ >p i; and so p x = pi. Thus (fc+ l)/pj is an integer 
not exceeding k , and 

/c “f" 1 

P 2 • • • Pu = —— =p 2 ' . . . p v '. 

Pi 

Hence, by the induction hypothesis, u = v, p 2 = p 2 , • . ., and p u ' = p u . 
Thus the fundamental theorem of arithmetic is established. ■ 
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EXERCISES 

1. Let E be the set of all positive even integers. Define m to 
be an “even prime” if m is even but is not factorable into 
two even numbers. Prove that some elements of E are 
not uniquely representable as products of even primes. 

2. Prove that every positive integer is uniquely represent¬ 
able as the product of a nonnegative power of 2 (perhaps 
2°) and an odd number. 

3. Suppose that a = p,p 2 ■ ■ ■ Ps is the unique factorization 
of a into primes ( Pi — P2 — • • • — Ps) ■ Prove that a has 
a unique representation 

n ei n e2 n er 

Q1 H2 • • • Hr ? 

where the < 7 * are primes, < 7 i < Q 2 < • * • < < 7 r> an d the e* 
are positive integers. 


4. Prove that, if a = qi'qf 2 ■ ■ ■ q/ r and b = s/'s 2 ft ■ ■ ■ sJ u 
are the factorizations of a and h into primes (see Exercise 
3), then there exist primes t x < t 2 < . • ■ < t v and non¬ 
negative integers g { and h ( such that 

a = ... t v B v, and b = t 1 hl t 2 h *. .. t v h ». 

5. Using the notation of Exercise 4, prove that 

g.c.d.(a,b) ~ t\ l t 2 2 . * . , 

where each c { is the smaller of the corresponding g, and 

h t . 

6. Use Exercise 5 to find 

(a) g.c.d.(121,66) (d) g.c.d.(2187,999) 

(b) g.c.d.( 169,273) (e) g.c.d.(64,81) 

(c) g.c.d.(51,187) (f) g.c.d .(p 2 q,pqr), where 

p, q, and r are primes. 

7. Using the notation of Exercise 4, prove that 

l.c.m.(a,fc) = t 1 n t 2 2 ■ ■ • t v lv , 

where each j { is the largest of the corresponding g t and h t . 
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8. Do Exercise 4 of Section 2-2, using Exercises 5 and 7 of 
this section. 

9. Use Exercise 7 to find 

(a) l.c.m.(125,150) (d) l.c.m.(253,506) 

(b) l.c.m.(132,154) (e) l.c.m.(lll,1221) 

(c) l.c.m.(39,143) (f) l.c.m.(p 2 q,pqr) , where 

p , q , and r are primes. 

10. For each finite set of integers {a,fc,c,. . . ,r}, we can 
define 

g.c.d.(a,h,c,...,r) 

to be the largest integer that divides each of a, b, c, . . ., 
and r. We can also define 

l.c.m. . . ,r) 

as the smallest integer that is divisible by each of a , h, c, 

. . ., and r. Find formulae for g.c.d.(a,h,c, . . .,r) and 
l.c.m.(a,h,c, . . . ,r) by generalizing the assertions in Ex¬ 
ercises 4, 5, and 7. 

11. Find g.c.d.(39,102,75) and l.c.m.(39,102,75). 

12. Prove that, if d x = g.c.d.(a,h), d 2 = g.c.d.(h,c), d 3 = 
g.c.d .(c,a), D = g.c.d.(a,b,c), and L = l.c.m.(a,fo,c), 






CHAPTER 3 


COMBINATORIAL AND 
COMPUTATIONAL NUMBER THEORY 


Much of number theory is concerned with the properties of 
primes. In Chapter 2 we saw that these numbers are the multiplica¬ 
tive building blocks of the integers. In Sections 3—2 and 3—3, we shall 
use combinatorial techniques to obtain two surprising results about 
primes. The combinatorial ideas underlying this approach will also 
be used in proving many of the theorems in later chapters. In the 
fourth section, we shall introduce one of number theory’s most useful 
tools, the generating function. To conclude the chapter, we shall dis¬ 
cuss the role of computers in number theory. 


3-1 PERMUTATIONS AND COMBINATIONS 

Although permutations and combinations are usually associated 
with probability theory, they are also relevant to number theory. For 
instance, let us consider a problem that faces a number theorist each 
time he visits a Chinese restaurant. 

Example 3-1: The Dinners for Two on a particular Chinese 
menu are presented as follows: 


Dinners for Two 

You may select one dish from each category. 

Category A Category B 

Fung Wong Guy Chicken Chow Mein 

Wor Hip Har Ho Yu Gai Poo 

Moo Goo Guy Pen 
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How many different Dinners for Two are available? Without any 
difficulty, we can list all the available dinners: 

Fung Wong Guy and Chicken Chow Mein, 

Fung Wong Guy and Ho Yu Gai Poo, 

Wor Hip Har and Chicken Chow Mein, 

Wor Hip Har and Ho Yu Gai Poo, 

Moo Goo Guy Pen and Chicken Chow Mein, and 
Moo Goo Guy Pen and Ho Yu Gai Poo. 

Of course, we may easily count the dinners without listing them. We 
have 3 choices in Category A, and, after we make a decision there, 
we have 2 choices in Category B. Thus, without looking at the list, 
we note that there are 


2 + 2 + 2 = 3 -2 = 6 

different dinners. 

The simple counting procedure employed in Example 3-1 is a 
particular instance of the following fundamental rule. 

General Combinatorial Principle: If an element a can 
be chosen from a prescribed set S in m different ways, and if there¬ 
after, a second element /3 can be chosen from a prescribed set T in n 
different ways, then the ordered pair (a,f) can be chosen in mn dif¬ 
ferent ways.* 

You may be wondering what all this can really have to do with 
number theory. The following examples lead us to expect that the 
product of any n consecutive positive integers is divisible by the 
product of the first n positive integers; though this assertion appears 
to have no direct relationship to combinatorial ideas, we shall see 
that the proof of it involves all the combinatorial concepts to be in¬ 
troduced in this section. 

Example 3-2: For n = 4, the product of the first four integers is 
1 • 2 ■ 3 • 4 = 24, and we observe that 5 • 6 • 7 • 8 = 1680 = 70 • 24; 
also 10 • 11 • 12 • 13 = 17160 = 715 • 24. 

Example 3-3: For n = 5, the product of the first 5 integers is 
1 • 2 • 3 • 4 * 5 = 120, and we observe that 4 • 5 • 6 • 7 • 8 = 6720 = 
56 • 120; also 11 • 12 • 13 * 14 • 15 = 360360 = 3003 • 120. 


*This principle is actually a theorem in the foundations of mathematics. See 
Theorem 10.4.12 in The Anatomy of Mathematics by Kershner and Wilcox. 
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Definition 3-1: An r-permutation of a set S of n objects is an 
ordered selection of r elements from S. 

Example 3-4: If S = {4,5,6}, then the 2-permutations of this set 
are (4,5), (5,4), (4,6), (6,4), (5,6), and (6,5); the 3-permutations of this 
set are (4,5,6), (4,6,5), (5,4,6), (5,6,4), (6,5,4), and (6,4,5). 

Theorem 3-1: If„P r denotes the number of r-permutations of a 
set of n objects , then 

n P r = n(n — 1) . . . (n — r -1-1). 

PROOF: We may make our first selection of an object in n dif¬ 
ferent ways. Once the first selection has been made, we may make the 
second selection in n — 1 ways from the remaining elements of the 
set. By mathematical induction, we see that we may make the ith 
selection in exactly n — (i — 1) ways. Thus the general combinatorial 
principle tells us that the r selections we are to make can be ex¬ 
ecuted in 


n(n — 1) (n — 2) . . . (n — r + 1) 
different ways. Thus we obtain the desired formula 

n P r —n(n — 1) . . . (n —r+1). H 

Of special interest to us is the symbol r P r , which we shall write as 
r! (read /factorial); we observe that l! — 1, 2! = 2, 3! = 6, 4! = 24, and 
so on. 

Definition 3-2: An r-combination of a set S of n objects is a 
subset of S having r elements . 

Example 3-5; If S = {4,5,6}, then the 2-combinations of S 
are {4,5}, {4,6}, and {5,6}. 

Note that r-permutations include all possible orderings and are 
therefore more numerous than r-combinations. 

Theorem 3-2: If j denotes the number of r-combinations 
taken from a set S of n elements , then 

/n\ _ n(n — 1) . . . (n — r + 1) 

W~ rl 
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PROOF: To each of the different r-combinations of S, we 
may give r P r different orderings. Consequently, 


r ? r ■ (") = n P r . (3-1-1) 

Thus, 

(n\ _ n ?r_ n(n- 1) . . . (n-r+1 ) 

\r) r Pr r\ " 

We are now able to prove the conjecture made near the beginning 
of this section. 


Theorem 3-3: The product of any n consecutive positive in¬ 
tegers is divisible by the product of the first n positive integers. 

Proof: The product of n consecutive integers, the largest of 
which is N, is precisely 


N(N-l) . . . (N — n + 1) = N P n . 


Furthermore, by equation (3-1-1), 

Since (^j is an integer, it follows that n! | N P n , where, of course, n! 
is the product of the first n positive integers. ■ 


The combinatorial technique we’ve used here is very powerful. 
To show that n \ \ N(N — 1) . . . (N — n+ 1), we established a com¬ 


binatorial interpretation of N(N— 1) . . . (N — n+ l)/n! (that is, 



since j is an integer, we were able to deduce the desired result 

In the next two sections we shall further illustrate this technique by 
using it in the proofs of two additional divisibility theorems. 


EXERCISES 

1. Prove that a set S of n elements has precisely 2 n subsets 
(the empty set and S itself are counted as subsets). 
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2. Use Exercise 1 to give a combinatorial proof of the formula 

*- i+ (j) + G) + (;) + - + 0- 

3. Using the definition of as the number of r-combina- 
tions of a set S of n elements, prove combinatorially that 

CHVMri 1 )- 

4. Give a new proof of Theorem 3-2, using the principle of 
mathematical induction in conjunction with Exercise 3. 

5. Prove that 

+ c: i ) + c: 2 ) + ... + (T)-( f+ ^ 1 )- 

[Hint: Use the principle of mathematical induction and 
the equation appearing in Exercise 3.] 

6. Prove that 

if 0<r=^n. [Hint: Use the principle of mathematical 
induction and the equation appearing in Exercise 3.] 

7. Prove that 

1 + © + © + ( 2 ) + • • • -2 "' 1 ' 

and that 



CM;) + (;M?) + 


8. Prove that 


C Hn-r)' 
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9. Prove that if p is a prime and 1 < a < p, then p 



10. Prove the binomial theorem: 

(x + y) n = x n + * M_ V + • • • 

[Hint: Use the principle of mathematical induction and 
the equation appearing in Exercise 3.] 

11. Prove that if p is a prime and n is a positive integer, then 
p | n p — n. [Hint: Use the mathematical induction on n, 
Exercise 9, and the binomial theorem with x — k and 

y = 1.] 


12. Suppose that the sequences of real numbers a 0 ,ai,a 2 ,a 3 ,. .. 
and b 0 ,b 1 ,b 2 ',b 3 , . . . satisfy the relation 

0.-2 (;K 

r = 0 N ' 


Then prove that 

(-1 Yb n = ± (”) (-1 Ya s . 

[Hint: Substitute the formula for a s into ^ (— l) s a s . ] 

13. Recall the definition of the Fibonacci numbers :F 1 = F 2 = 1, 
and F n = F n _j + F n _ 2 if n > 2. Prove that 

F - i+ (": 2 ) + (V) + - + (;ri) + rr 1 )' 

where j is the largest integer less than or equal to (n — l)/2. 

14. Prove that 


(l) (2) (3) +•• • + ( n ” 1) F n -i + F n — F 2 

where F n denotes the nth Fibonacci number. 
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3-2 FERMAT’S LITTLE THEOREM 

We shall use the combinatorial techniques of Section 3-1 to prove 
a result obtained by one of the founders of modern number theory, 
P. Fermat (1601-1665). 

Theorem 3-4: If p is a prime and n is a positive integer , then 
p | n p — n. 

Proof: Suppose that we wish to form strings of p colored beads 
each, and that we have on hand enough beads to permit unlimited 
use of each of n colors. How many different strings can we form? By 
the general combinatorial principle, the answer is clearly n p , since 
each bead may be chosen in exactly n ways and since there are p 
choices for each string. Figure 3-1 illustrates the case in which 
n — 3 and p = 3. 

Of the n p possibilities, exactly n strings have beads of only one 
color. Put these aside, and, in the manner illustrated in Figure 3-2, 
join the two ends of each of the remaining n p — n strings to form 
equally many bracelets. 



Figure 3-1 The twenty-seven strings of three beads with three possible colors. 
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We can alter any string of beads by removing a bead from the top 
end and placing it on the bottom end; this alteration produces a 
different string of beads without changing the resulting bracelet. 
(When n = 3 and p = 3, the 24 possible multicolored strings fall 
naturally into eight groups of three strings obtainable from each other 
by one or more repetitions of this alteration [see the first eight groups 
in Figure 3-2]; to each of these eight groups there corresponds a 
distinct bracelet [see Figure 3-3].) Let k be the least number of times 
this alteration may be repeated before the original color scheme of the 
string is reproduced. Certainly k > 1, since we have set aside all 
monocolor strings. Note that after 2k steps the bracelet’s original 
color scheme will again be reproduced; and, similarly, after 3k steps, 
4 k steps, and so on. By Euclid’s division lemma (Theorem 2-1), 
there exist h and r such that 

p = hk + r (0 ^ r < k). 

Since the color scheme is reproduced after hk steps and is also repro¬ 
duced after p steps (because after p steps all the beads are back in 
their original positions), it takes exactly r steps after the hkth step to 
get a reproduction of the color scheme. Since r< k and k is the least 




Figure 3-3 The eight bracelets of three beads with three possible colors 
(monocolor bracelets excluded). 
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positive number of steps required to obtain a reproduction, we see 
that r must be 0. Hence p = hk, and so k = p, since k > 1 and p is a 
prime. Consequently the n p — n strings fall naturally into groups of p 
strings each, and it is clear that each group gives rise to a separate 
bracelet. 

Thus the number of bracelets N multiplied by p gives the number 
of strings that are not monocolor, namely n p — n. Hence pN=n p —n; 
that is, p | n p — n. M 


EXERCISES 

1. Prove that if g.c.d.(n,p) = 1, then p \ n p ~ 1 — 1. 

2. Prove that 6 | n 2 — 1 if g.c.d.(n,6) = 1. 

3. Prove that n 5 and n both have the same last digit. 

4. In view of the fact that there are n! n-permutations of n 
colors, does the proof of Theorem 3-4 provide a proof that 
n! | n p — n? 

5. Prove that if 3~|~n, then 3 | n 2 — 1. 

6. Prove that if 5“|"(n — 1), n, and S - ! - (n +1), then 5 | (n 2 + 1). 
[Hint: Multiply together n, n — 1, n + 1, and n 2 + 1. ] 

3-3 WILSON'S THEOREM 

The theorem proved in this section was attributed to Sir John 
Wilson (1741-1793) by E. Waring in Meditationes Algebraicae 
(1770); however, G. W. Leibnitz appears to have discovered it 
before 1683. 

Theorem 3-5: If p is a prime , then p | [(p — 1)! + 1]. 

PROOF: If p = 2, the theorem is obvious; therefore we can assume 
that p is an odd prime. Consider p points on a circle distributed so that 
they divide the circle into p equal arcs. How many polygons can we 
form by joining these points (crossing of edges is allowed)? These 
polygons are called stellated p- gons, because their vertices are the 
vertices of a regular convex polygon with p sides. Recalling the 
general combinatorial principle, one might expect p P p (—p\) ways, 
since we may choose the first vertex in p ways, the second in (p —1) 
ways, and so forth; however, note that we can describe each of the 
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Figure 3-4 The twelve stellated pentagons. 


p-gons in 2p different ways, namely, by starting at any one of its p 
vertices and choosing one or the other of the two segments at that 
vertex as the initial segment. Therefore, we really obtain p!/2p dif¬ 
ferent p-gons. Figure 3-4 shows the twelve stellated pentagons. 

Of the p!/2p p-gons, exactly (p — l)/2 are unaltered when rotated 
through an angle of 2irIp radians; such unalterable p-gons are called 
regular stellated p-gons since they are “stars” of p points, with each 
point the vertex of a (2k + l) 7 r/p angle (0 < k < (p — l)/2). In the 
case p = 5, there are two such pentagons, shown in the third row of 
Figure 3-4; in the case p = 13, there are six unaltered 13-gons 
(triskaidecagons), as illustrated in Figure 3-5. The remaining 
p!/2p — (p — l)/2 stellated p-gons fall naturally into sets of p ele¬ 
ments; the members of each set can be obtained from a single member 
by successive rotations through 27r/p. The observation that there are p 
elements in each set may be verified as in the proof of Fermat’s little 
theorem, where we showed that each bracelet arose from p strings of 
beads. When p = 5, there are two such sets (they constitute the first 
and second rows of Figure 3-4). 

Thus, the total number of sets is 

p! _ p — 1 

2 P 2 _ (p — 1)! — (p-1) 

2 p 


V 
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Figure 3-5 The six regular stellated 13-gons. 


Hence 2p \ [(p — 1)! — p + 1]; consequently, p\{p — 1)!+1, as 
desired. ® 


EXERCISES 

1. Prove that p is the smallest prime that divides (p — 1)! + 1. 

2. Prove that 10")” [(n — 1)! + 1], for each n. 

3-4 GENERATING FUNCTIONS 

This section is devoted to a technique with numerous applica¬ 
tions throughout combinatorial analysis and the theory of numbers. 

Definition 3-3: If A = {a n } n = b = {a b , a b+1 ,a b+2 , ■ ■ •} is a se¬ 
quence of real numbers, then 

f A (x) = a n x n 

n = b 


is called the generating function for the sequence A. 
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The function f A (x) is defined for all values of x for which the 
series converges. These admissible values of x will vary from 
sequence to sequence. In most cases the lower index b will be either 
0 or 1. 

Example 3-6: If a n = 1 for all n ^ 0, then by the formula for the 
sum of an infinite geometric series, 

Ia(x) = 2 X ” = TTT7> 1*1 <1- 

tt= 0 IX 

(The second equality above follows if we let n—» 00 in Theorem 1-2.) 
Example 3-7: If a n = 1/n! for all n ^ 0, then 


f A (x) = 2 x n /u \ = e x ; 

n = 0 

this series converges for all real values of x. 

To illustrate the use of generating functions, we shall solve the 
following rather difficult problem: 

Let a n denote the number of ways of associating multiplicands 
with parentheses in an (ordered) product of n numbers so that the 
resulting expression correctly defines a multiplication of the numbers. 
What is a simple expression for a n ? 

Now a t = a 2 = 1. Since three numbers may be associated in two 
ways, namely, b l (b 2 b 3 ) and ( b 1 b 2 )b 3 , we see that a 3 = 2. Further¬ 
more, a 4 — 5, since four numbers may be associated as follows: 
((bib 2 )b 3 )b 4 , {bi(b 2 b 3 ))b 4y {b x b 2 )(b 3 b 4 ), b 1 (b 2 (b 3 b 4 )), b^b^^b^. 
The following relationship holds among the a { : 

a n + i ~ d\a n + 02#n-i + 030 n —2 + • • • + a n a x . (3-4-1) 

The ith term in this sum, afa n _ i + 1 , is the number of associations of 
n + 1 terms in which the two outermost bracketings contain i terms 
and n — i + 1 terms, respectively; there are a* ways of associating the 
terms inside the first set of brackets, and a n _ i+1 ways of associating 
those inside the second set. The total number of bracketings is the 
sum of the number of bracketings with i terms in the first set of outer¬ 
most brackets for 1 < i < n. 

Our objective now is to prove the following result. 

Theorem 3-6: a n = ( 2n ~ 2 
\ n — 1 
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PROOF: We start with the defining formula 
IaM = J) a n x n . 

n= 1 


Then, by relation (3-4-1), 


/.ilx) = x + ^ (o,a„-, + . . . + a n - 1 a 1 )x n 

71 = 2 


= *+(i a n xA ( 2 a n xA 

\n =l / \n=l ' 




(the penultimate equation follows directly from the formula for the 
multiplication of power series). Thus f A (x) satisfies the quadratic 
equation y 2 — y + x = 0. 


1 (1 — 4x) 112 

Hence f A (x) is one of the two functions - 2 - an< ^ 


i n — 1/2 

- — -=—-—. Since/^(0) =0, we see that f A (x) is in fact the second 

Z Z 

of these. Thus 




(3-4-2) 


We now expand (1 — 4x) 1/2 into a binomial series: 

/ 1 \ 


/„(*)=!“! % (-D-I2 14-*- 


where 


i\ . . ik\ 


2 I = 1 and | 2 | = 

VO/ 


n! 


Consequently, 


fA(x)=~ 2 (-D M 2 4”*". 


w 


(3-4-3) 


In passing we note that the series for f A (x) converges for | x \ < 1/4; 
this may be proved by means of the ratio test. 
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Since a function has at most one MacLaurin series expansion, the 
coefficients of the series in equation (3-4-3) must be a n . (Recall that 
a MacLaurin series expansion is a Taylor series expansion about the 
origin x = 0.) Hence 


a n = —^ (-D" 2 4" 


= (-D 


-.Kl-O-d-■») - 


n! 


- r_n«2 1 • (-1X1-2) ■ . . (1 —2(n — 1))2" 
L) ° n! 


11-3... (2n — 3)2" 

2 n! 

1 1 -3 . . . (2n-3) • n!2" 

2 n! • n! 


1 1 -2 -3 -4 . . . (2n-4)(2n-3)(2n-2)2n 

2 (n!) 2 

1 1 -2 -3 -4 . . . (2n — 2) 
n [(n — 1) !] 2 

/2n —2\ / 

U-i //”• 


If you attempted to guess the form of a n before examining 
Theorem 3-6, you undoubtedly had difficulty. Even if you had dis¬ 
covered the correct form, you would still have been faced with the 
arduous task of proving that it was the right form. However, with the 
aid of generating functions, we were able to establish easily the 
formula in Theorem 3-6. 


EXERCISES 

1. Prove that x/(l — x) 2 is the generating function for the 
sequence aj = 1, a 2 = 2, . .., a n = n, .... 


2. Prove that —log (1 — x) is the generating function for the 
sequence a x = 1, a 2 = 1/2, . .., a H — 1/n,_ 


3. Let 


M!D(oH^)(rw 2 )(?) 


+ 


+ 
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(t))(ff)' ^sing *^ ie binomial theorem, prove that 
(l + 3 c) JV (l + 3 c) Jtf is the generating function for d n . Con¬ 
clude thatd n = ( N+ n M )- 

4. Prove that 1/(1 — x) 3 is the generating function for the 
sequence of triangular numbers a 0 — 1, Oi = 3, . . . ,a„ = 
(n + 2\ 


5. Prove that x/( 1—x — x 2 ) is the generating function for the 
sequence of Fibonacci numbers F t = 1, F 2 = 1, . • •, F n = 

oo 

F n _! + F„_ 2 ,_[Hint: Prove that if/ F (x) = ^ tb en 

(1 - X - x 2 )f F (x) = x.] 

6. Prove that 1/(1 — 2%) is the generating function for the 
sequence a 0 = 1, ai = 2, . . a n = 2", . . .. 

7. Prove that (1 _ b x )(l - bx - x) is the generating function 
for the sequence a 1 = 1, a 2 — 2b + 1, . . ., a n = (b+ 1)" — 

b n , . . .. 

8. What is the generating function for the sequence defined 
by o 0 = 1 and a n = 2 for n > 0? 


3-5 THE USE OF COMPUTERS IN 
NUMBER THEORY 

This section is not intended as an introduction to programming. 
Rather, it is intended to suggest the importance of computation in 
number theory. 

The computer has been used to prove a variety of theorems in 
number theory; for example, Donald Gillies used two hours and 
fifteen minutes on the Illiac II at the University of Illinois to establish 
that 2 ll213 -l, a number of 3376 digits, is a prime. The University of 
Illinois was proud enough of this achievement to put 2 11213 -1 IS 
PRIME” on its postage meter, so that it appeared on all outgoing mail. 
However, the computer’s most important role has been to provide 
numerical data from which one may guess the truth of a theorem. 

At the end of this section are a number of projects requiring only 
a modest knowledge of computer programming. Indeed, many of 
them are so simple that an energetic student with a desk calculator 
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(or with several sharp pencils and a pot of coffee) may complete them. 

To illustrate a typical project, along with conclusions that might 
be drawn, let us work through the following problem: 

A perfect number is a number equal to the sum of its proper 
divisors (that is, all divisors except the number itself); find all perfect 
numbers n(2 ^ n ^ 500). 

To proceed, we need only write a program that determines for 
each n all proper divisors d x , d 2 , . . ., d r of n, and then prints out n 

r 

whenever ^ d t = n. 

i=i 

The diagram in Figure 3-6 indicates the steps a computer could 
follow in finding all perfect numbers not exceeding 500; such a 
diagram is called a flow diagram or flow chart . 

One may instead use a reference such as the C.R.C. Standard 
Mathematical Tables and a desk calculator to determine quickly the 
sum of the proper divisors of n. The perfect numbers not exceeding 
500 are 6, 28, and 496. 

It is easy to formulate conjectures about these numbers, even 
from these meager data. Note that 6 = 2 • 3, 28 = 2 2 • 7,496 = 2 4 • 31. 
Thus, in each instance, the perfect number is the product of a power 
of 2 and a single odd prime; in particular, each perfect number not 
exceeding 500 is of the form 2 P ~ 1 (2 P - 1), where 2 P -1 is an odd 
prime. 

Conjecture Is 2 p ~ 1 (2 p — 1) is a perfect number whenever 
2 P — 1 is a prime. 

Conjecture 2: All perfect numbers are of the form given in 
Conjecture 1. 

We can easily prove Conjecture 1. Suppose 2 P — 1 is a prime. 
Then, the proper divisors of 2 P ~ 1 (2 P — 1) are 1, 2, 4,. . ., 2 P ~\ 2 P — 1 , 
2(2 P — 1), . . ., and 2 P ~ 2 (2 P — 1). The sum of these numbers is 


2 2i + S' (2 p -l)2 j = 2 p -l+ (2 p -l)(2 p - 1 -l)=2 p ~ 1 (2 p -l). 

j=0 j =0 

Conjecture 2 was raised by the ancient Greeks, and it is still not 
completely solved. L. Euler proved that every even perfect number is 
of the form given in Conjecture 1; however, it is not known whether 
or not any odd perfect numbers exist. We know only that none exist 
that are less than 10 36 (this result was proven in 1967). 

Any student of number theory who wishes to obtain an adequate 
knowledge of computer programming should consult one of the 
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many excellent elementary texts, for example, Basic Fortran IV Pro¬ 
gramming by Healy and Debruzzi (a programmed self-instructional 
text), Introduction to Fortran IV Programming by Blatt, or A Guide to 
Fortran Programming by McCracken. 


EXERCISES 


The part of the text to which each computer program 
is relevant is listed at the end of the corresponding exer¬ 
cise. You are encouraged to form conjectures from the 
data you obtain with your programs. 

1. Find the number d(n) of divisors of n for 1 ^ n ^ 200. 
(Chapters 2, 6, and 15) 

2. Find all the primes smaller than 1000. (Chapters 2 and 8) 

3. Find all integers smaller than 1000 that are either perfect 
squares or sums of two perfect squares. (Chapter 11) 

4. Find all integers smaller than 1000 that are either perfect 
squares or sums of two or three perfect squares. (Chapter 

ID 

5. Find all integers smaller than 1000 that are either perfect 
squares or sums of two, three, or four perfect squares. 
(Chapter 11) 

6. A number n is called a quadratic residue modulo p 
(where p is a prime) if p“|~n, and if there exists an integer 
m(l < m < p) such that p \ m 2 — n. For p = 11, find all 
numbers r(l ^ r < p) that are quadratic residues modulo 
p. Do the same for p = 3, 5, 7, 17, 29, and 61. (Chapter 9) 

7. Using the results of Exercise 6, find the number of con¬ 
secutive pairs of quadratic residues modulo p for p = 3, 5, 
7, 11, 17, 29, and 61. Compare your answer with p/4 in 
each case. (Chapter 10) 

8. Find the number of ways an integer n can be represented 
as a sum of distinct positive integers for n ^ 30 (for ex¬ 
ample, 5 can be represented in three ways: 5, 4 + 1, and 
3 + 2). (Chapter 12) 
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9. Find the number of ways an integer n (n ^30) can be 
represented as a sum of odd positive integers (for example, 
5 can be represented in three ways: 5, 3 + 1 + 1, and 
l + l + l + l + l). Compare your results with those ob¬ 
tained for Exercise 8. (Chapter 12) 

10. Find the sum o-(n) of the divisors of n for 1 ^ n ^ 200. 
(Chapter 6) 

11. Let denote the number of positive integers not 

exceeding n that are relatively prime to n; find <\>{n) for 
1 < n ^ 100. (Chapter 6) 

12. Let b (n) denote the number of ways of representing n as 
a sum of nonnegative powers of 2 (for example, b( 4) =4, 
since 4 has the representations 4, 2 + 2, 2 + 1 + 1, 
1 + 1 + 1 + 1); find b(n) for l < n ^ 40. Are there any 
significant relationships between b(2n + 1), b(2n), b(n), 
and b(n — 1)? (Chapter 13) 

13. For 1 < n ^ 15, compute the number t{n) of representa¬ 
tions of n as a sum of positive integers that are not multi¬ 
ples of 3 (for example, t( 4) =4, since 4 has the repre¬ 
sentations 4, 2 +2, 2 + 1 + 1, 1 + 1 + 1 +1). (Chapter 12) 

14. For 1 < n ^ 15, compute the number v(n) of representa¬ 
tions of n as a sum of positive integers in which no sum¬ 
mand appears more than twice (for example, v(4) =4, 
since 4 has the representations 4,3 + 1,2 + 2,24-1 + 1). 
Compare v{n) with t{n). (Chapter 12) 





CHAPTER 4 


FUNDAMENTALS OF CONGRUENCES 


In Chapter 3, we selected the proofs of Fermat’s little theorem 
and Wilson’s theorem for illumination rather than for brevity. More 
succinct proofs of these theorems are possible with congruence nota¬ 
tion, first introduced by Gauss. In this chapter and in Chapter 5, we 
shall investigate the many important properties of congruences. 


4-1 BASIC PROPERTIES OF CONGRUENCES 

Definition 4-1: If c # 0, we say that a = b (mod c) (read “a is 
congruent to b modulo c”) provided that (a —b)/c is an integer (that 
is, provided that c \ a — b). 

Example 4-1: Since (8 —2)/2 = 3 is an integer, we see that 
8 = 2 (mod 2); but 8^3 (mod 2), because (8 — 3)/2 = 2.5 is not an 
integer. Similarly, 17 = 12 (mod 5), 100 = —40 (mod 20), and 11 = —1 
(mod 12). 

Example 4-2: We can now express Fermat’s little theorem in 
congruence notation: 


n p = n (mod p); 

similarly, we can state Wilson’s theorem in the form 
(p — 1)! = —1 (mod p). 

Example 4-3: If c ^ 0 and a= b, then a = b (mod c). Thus, 
equal integers are congruent modulo any nonzero integer; however, 
congruent integers are not necessarily equal. 

If (a — b)/c is an integer, so is (a — b)/(—c) = —{a — b)/c ; hence 
a = b (mod c) if and only if a = b (mod —c). Thus congruences with 
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negative moduli may be replaced by equivalent congruences with 
positive moduli. Throughout this chapter we shall assume that all 
moduli are positive. 

The relation “is congruent to” is an equivalence relation. For 
those who are unfamiliar with equivalence relations, we shall now 
prove that the congruence relation has all the defining properties of 
an equivalence relation, 

Theorem 4-1: If a , b, c, and d are any integers (c # 0), the 
following assertions hold: 


a ss a (mod c); (4-1 -1) 

if a ss b (mod c), then b = a (mod c ); and (4-1 -2) 

if a — b (mod c) and b = d (mod c), then a = d (mod c). (4-1 -3) 

Remark: Statements (4-1-1), (4-1-2), and (4-1-3) are the 
reflexive, symmetric, and the transitive properties of congruences, 
respectively. 

PROOF: (a~a)lc = 0 is an integer; thus (4-1-1) holds. If 
(a - b)lc is an integer, then (b - a)/c = -(a~ b)/c is also an integer; 
thus (4-1-2) is true. Finally, if (a - b)jc and (b - d)lc are integers, 
then so is (a-d)lc, because it is the sum of the first two. Thus 
(4-1-3) is confirmed. ® 

The next theorem shows that congruences may be validly added, 
subtracted and multiplied. 

Theorem 4-2: Suppose a = a' (mod c) and b = h f (mod c); 
then 


a±b ss a' ± b' (mod c), and 
ab = a'b' (mod c). 


(4-1-4) 

(4-1-5) 


Proof: Since (a-a’)lc and (b-b')lc are integers, so are 
(a±b) — ( a' ± b') _ a — a' (b — b') an( j ab — a'b’ _ ^ b — b[ + 
c c ~ c c c 


Example 4-4: Since 


19 = 11 (mod 4) 


(4-1-6) 
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and 


6 = 2 (mod 4), 


(4-1-7) 


we see that 

25 = 13 (mod 4) (by addition of (4-1-6) and (4-1-7)), 

13 = 9 (mod 4) (by subtraction of (4-1-7) from (4-1-6)), 

114 = 22 (mod 4) (by multiplication of (4-1-6) and (4-1-7)). 

Note that addition, subtraction, and multiplication of two congruences 
are permissible only when the congruences have the same moduli. 

Division of both sides of a congruence by an integer is not always 
permissible; for example, in the congruence 14 = 4 (mod 10), it is 
not valid to divide both sides by 2, because 7^2 (mod 10). The 
following theorem provides conditions under which division is 
permissible. 

Theorem 4-3 (Cancellation Law): If bd = bd ' (mod c ) and if 
g.c.d.(fo,c) = 1, then d = d' (mod c ). 

PROOF: Since (bd — bd')/c is an integer, c \ b(d~d'). Thus, 
by Theorem 2-3, c \ d — d’, that is, d = d' (mod c). ■ 

Example 4-5: Since 6 = 12 (mod 2) and g.c.d.(2,3) = 1, we can 
conclude that 2 = 4 (mod 2); however, we cannot conclude that 
3 = 6 (mod 2), since g.c.d.(2,2) = 2 ^ 1. 

We have now established procedures for performing algebraic 
operations on congruences much like the customary operations on 
ordinary equations. 


EXERCISES 

1. Find integers x such that 

(a) 5x = 4 (mod 3) 

(b) 7x = 6 (mod 5) 

(c) 9x = 8 (mod 7). 

2. Do there exist integers x such that 

(a) 6x = 5 (mod 4) 

(b) lOx = 8 (mod 6) 

(c) 12x = 9 (mod 6)? 
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3. Prove that if x = y (modm) and a 0 , a u . . a r are integers, 
then a 0 x r + a 1 x r_1 + . . . + a r = a 0 y r + a x y r ~ l + . . . + a r 
(mod m). 

4. Prove that if bd = bd' (mod p), where p is a prime and 
pfb, then d = d' (mod p). 

5. Prove that if | a | < k/2, \b \ < fc/2, and a = b (mod k), 
then a = b. 

6. Is there a perfect square n 2 such that 

n 2 = — 1 (mod p) (4-1 -8) 

for p= 3? for p=5? for p= 7? for p= 11? for p=13? for 
p = 17? for p = 19? Can you characterize the primes p for 
which congruence (4-1-8) has a solution? 

7. Which of the following congruences hold? 

(a) 12,345,678,987,654,321 = 0 (mod 12,345,678) 

(b) 12,345,678,987,654,321 = 0 (mod 12,345,679) 

(c) 57 = 208 (mod 4) 

(d) 531 - 1236 (mod 7561) 

(e) 12321 = 111 (mod 3). 

4-2 RESIDUE SYSTEMS 

Theorem 4-2 assures us that, in a congruence involving only 
addition, subtraction, and multiplication, we may validly replace 
integers with congruent integers. We shall now study some important 
concepts that will facilitate making these replacements. 

Definition 4-2: If h andj are two integers and h — j (mod m), 
then we say that j is a residue of h modulo m. 

Definition 4-3: The set of integers { r,, r 2 , ■ ■ •, r s } is called a 
complete residue system modulo m if 

(i) Tj ^ rj (mod m) whenever i ^ j; 

(ii) for each integer n there corresponds an r t such that n = r t 
(mod m). 
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Theorem 4-4: If s different integers r l9 r 2 , . . ., r s form a com¬ 
plete residue system modulo m , then s — m. 

Proof: Let t 0 = 0, t t = 1, t 2 = 2, . . ., and t m - l = m — 1. Then 
the m integers t 0 * *m-i form a complete residue system 

modulo m. To see this, note that for each n there exist unique integers 
q and u such that 

n — mq + u and 0 < u < m 

(by Euclid's division lemma). Hence n = u (mod ra), and u is one of 
the ti . Furthermore, since | — t j | ^m — 1, no two of the t t are 

congruent modulo m. Consequently, we see that the set {t 0 ,t i, . . ., 
tm-i} is a complete residue system modulo m. 

Thus, each r t is congruent to exactly one of the ti (so that s ^ m). 
Conversely, since the r { form a complete residue system, every t t is 
congruent to exactly one of the r t (so that m ^ s). Hence s = m. ■ 

Corollary 4-1: Let m be a positive integer; then {0,1,2,, 
m — 1} is a complete residue system modulo m. 

Example 4~6: The sets {1,2,3}, {0,1,2}, {—1,0,1}, and {1,5,9} 
are all complete residue systems modulo 3. 

When working with congruences modulo m, we can replace the 
integers in the congruences by elements of {0,1,2, . . .,ra— 1}. Such 
substitutions often lead to the simplification of seemingly compli¬ 
cated problems. 

Example 4-7: Find an integer n that satisfies the congruence 
325n - 11 (mod 3). (4-2-1) 

Since 

325 = 1 (mod 3) 

and 

11 = 2 (mod 3), 

we must find an integer n such that 

n = 2 (mod 3). 

The answer is now obvious; the integer 2 will do quite nicely. 
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In our study of congruences modulo m, we shall often be in¬ 
terested in the integers that are relatively prime to m. The elements 
of a complete residue system that are relatively prime to m form what 
is called a reduced residue system modulo m. 

Definition 4-4: The set of integers {r l9 r 2 , . . . ,r s ) is called a 
reduced residue system modulo m if 

(i) g.c.d.^m) = 1 for each i; 

(ii) r { 4 s r 5 (mod m) whenever i ^ j; and 

(iii) for each integer n relatively prime to m there corresponds an 
r { such that n = r t (mod m). 

Example 4-8: The set {0,1,2,3,4,5} is acomplete residue system 
modulo 6; consequently {1,5} is a reduced residue system modulo 6. 
In general, we can obtain a reduced residue system from a complete 
residue system by deleting those elements of the complete residue 
system that are not relatively prime to m. 

Example 4-9: Let p denote a prime; then {0,1,2,... ,p — 1} is 
a complete residue system modulo p. The only element of this set 
not relatively prime to p is 0; hence {1,2,.. .,p —1} is a reduced 
residue system modulo p. In general, if p is a prime, we can obtain a 
reduced residue system modulo p by deleting the one element in a 
complete residue system that is not divisible by p. Thus, we see that 
a reduced residue system modulo p has p— 1 elements. 

Definition 4-5: The function (f>(m) shall denote the number 
of positive integers less than or equal to m that are relatively prime 
to m. This function <f>(m) is called the Euler <f> function. 

Theorem 4-5: If s integers r t ,r 2 ,... ,r s form a reduced residue 
system modulo m, then s = <f>(m). 

PROOF: This proof is similar to that of Theorem 4-4, except 
that now we let t l9 t 2 ,. . t 0m) denote the ${m) positive integers 
not exceeding m that are relatively prime to m. For each integer n 
relatively prime to m, Euclid’s division lemma guarantees the 
existence of integers q and u such that, 

n = qm H- u and 0 ^ u < m. 

If g.c.d.(u,m) = d, then d \ u and d | m; thus d | n, and sod | g.c.d.(n,m). 
However, since n is relatively prime to m, g.c.d.(n,m) =1; conse¬ 
quently, <2—1, and we see that u and m are relatively prime. Hence, 
u is one of the f*. Since | t t — tj \ ^ m — 1, no two of the t t are con- 
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gruent modulo m; thus, the t t form a reduced residue system 
modulo m. 

Therefore, every r t is congruent to exactly one t t ; thus s ^ <f)(m). 
Conversely, since the r* form a reduced residue system, each t t is 
congruent to precisely one of the r*; thus Hence s = ■ 

The study of residue systems belongs both to number theory and 
to algebra. Students with a knowledge of modem algebra will recog¬ 
nize that the complete residue system constitutes a finite ring 
[fi tj is defined as the element r h such that r h se r i + r 5 (mod m); 
r t fj is that r h for which r h = (mod m)]. The reduced residue 
system is simply the group of units in this finite ring. Since we are 
concerned with the properties of integers rather than with the 
algebraic systems that may be constructed from them, we refer stu¬ 
dents interested in the algebraic approach to Chapter 1 of A Survey 
of Modern Algebra by G. Birkhoff and S. MacLane. 

EXERCISES 

1. Which of the following are complete residue systems 
modulo 11? 

(a) 0, 1, 2, 4, 8, 16, 32, 64, 128, 256, 512 

(b) 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21 

(c) 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22 

(d) -5, -4, -3, -2, -1, 0, 1, 2, 3, 4, 5. 

2. Which of the following are reduced residue systems 
modulo 18? 

(a) 1, 5, 25, 125, 625, 3125 

(b) 5, 11, 17, 23, 29, 35 

(c) 1, 25, 49, 121, 169, 289 

(d) 1, 5, 7, 11, 13, 17. 

3. Suppose {a t , a 2 , . . a k } is a complete residue system 
modulo k , where k is a prime. Prove that for each integer n 
and each nonnegative integer s there exists a congruence of 
the form 


n s ^ bjk j (mod k s+1 ) 

j=o 


where each b 3 is one of the a*. 
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4. Let w(n) denote the number of primes not exceeding n 
that do not divide n. Is w(n) < ? Can you find values 

of n for which w(n) = <j)(n) — 1? 

4-3 RIFFLING 

In this section we shall demonstrate the usefulness of congru¬ 
ences in solving a problem of frequent interest to college students. 

Take an ordinary deck of cards arranged in any order. How many 
shuffles are required before the deck returns to its original order? Of 
course, to idealize the real life situation, we must stipulate that only 
the modified perfect faro shuffle is permitted, as follows: Cut the deck 
into two equal 26-card packs; then proceed by alternating cards from 
each pack. The cards formerly in positions 1,2, .. .,26 are moved to 
positions 2,4,... ,52; and the cards formerly in positions 27,28,... ,52 
are moved to positions 1,3,... ,51. Thus, if a card starts in position x , 
it will end in position t/, where 1 ^ y ^ 52 and 2 x = y (mod 53). 
After n shuffles, the card will be in position u>, where 1 < w ^ 52 
and 2 n x = w (mod 53). 

To determine the number of necessary shuffles, we must find n 
such that 2 n x = x (mod 53) for every x such that 1 < x ^ 52. Since 
53 is a prime, we may cancel x from both sides of the congruence; 
thus we must solve the congruence 

2 n = 1 (mod 53). (4-3-1) 

By Fermat’s little theorem, we know that 

2 53 ^ 2 (mod 53). (4-3-2) 

Cancelling 2 from both sides of congruence (4-3-2), we find that 

2 52 = 1 (mod 53). (4-3-3) 

Hence, the cards will return to their original order after 52 shuffles. 
Actually, 52 is the least number of shuffles required, but we shall not 
prove this here. 

In general, if we have a deck of m cards, then n shuffles will return 
the cards to the original order provided that 

2 n = 1 (mod m + 1). (4-3-4) 


Thus, if m — 62, we need only 6 shuffles, since 2 6 — 1 — 63. 
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EXERCISES 

1. How many modified perfect faro shuffles are needed to 
return the cards to their original position in a deck of 6 
cards? of 8 cards? of 12 cards? 

2. Suppose that instead of performing a modified perfect faro 
shuffle as described in this section, we shuffle as follows: 
Take the bottom and top cards of the deck and place them 
on the table to start a new deck. Then take the remaining 
bottom and top cards and place them on the newly started 
pile. Continue this process until all cards are gone from 
the original pack. For example, if the deck has six cards, 
then we shuffle as shown in Figure 4-1. 


3~ I 

4~ I 

2 1 

5 1 

1 1 

~ i 

Figure 4-1 Shuffle of a deck of six cards described in Exercise 2 (side view). 

Prove that the cards in a deck of 2 n cards will return to their 
original positions after n -hi such shuffles. (Note that the 
shuffle described in this problem is mechanically some¬ 
what easier to perform than the modified perfect faro 
shuffle described earlier.) 


1 


2 


3_ 

I 

T 


6 


















CHAPTER 5 


SOLVING CONGRUENCES 


In Chapter 2, we studied the problem of finding integers x and y 
that satisfy the linear Diophantine equation 

ax + by = c. 

We can restate the problem as follows: For what values of x is 
(ax — c)lb an integer? In other words, what are the values of x for 
which 

ax ^ c (mod b)? 

In the first section of this chapter, we shall translate the theorem on 
the linear Diophantine equation (Theorem 2-4) into the language of 
congruences. 

We are often interested in solving several congruences simultane¬ 
ously. Such a problem arose relatively early in the history of mathe¬ 
matics. In the first century A.D., Sun-Tsu asked: What number yields 
the remainders 2, 3, and 2 when divided by 3, 5, and 7, respectively? 
In terms of congruences, Sun-Tsu was asking for an integer n that 
satisfies the three conditions 

n = 2 (mod 3), 
n = 3 (mod 5), 


and 


n = 2 (mod 7). 


In Section 5-3, we shall learn how to solve Sun-Tsu’s problem. 


5-1 LINEAR CONGRUENCES 

The simplest congruence problem may be phrased as follows: 
If a, b, and c are integers, can we find an integer n such that 
58 
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an = b (mod c)? (5-1 -1) 

Note that if n does satisfy this congruence, then all integers of the 
form n + kc are also solutions, because 

a(n + kc) = an + akc = an = b (mod c). 

Example 5-1: If a = 5, b = 3, and c = 8 , we see that 7 is a solu¬ 
tion of the congruence 


5 n ss 3 (mod 8 ). 


Indeed, each of the numbers . . ., —17, —9, — 1 , 7, 15, 23, 31, ... is a 
solution. 

So we see that if we can find a solution n of congruence (5-1-1), 
we can then construct infinitely many others. However, all the 
solutions we have constructed so far are congruent to n modulo c. 
This leads us to ask: How many mutually incongruent solutions of 
congruence (5-1-1) are there? 

To answer our two questions about congruence (5-1-1), we use 
the knowledge of the linear Diophantine equation we obtained in 
Theorem 2-4. Solving (5-1-1) is equivalent to finding integers n and k 
such that an — b = kc , that is, 

an+ (— c)k = b . ( 5 - 1 - 2 ) 

Now, by Theorem 2-4, integers n and k satisfying equation (5-1-2) 
exist if and only if d|&, where d= g.c.d.(a,c). Thus (5-1-1) has a 
solution n if and only if d\b , where d~ g.c.d. (a,c). 

To answer our second question, we note that, by Theorem 2-4, 
each solution of (5-1-2) has the form 

n = n 0 + ct/d k = k 0 + at/d , ( 5-1 - 3 ) 

where n 0 and k 0 constitute one specific solution, t is any integer, and 
d = g.c.d. ( a,c ). Among the different values of n described by (5-1-3), 
we note that the d values n 0 , n 0 + c/d , n 0 + 2 c/d, . .., n 0 + (d- l)c/d 
are all mutually incongruent modulo c , because the absolute dif¬ 
ference between any two of them is less than c. If n = n 0 + ct/d is 
any other value, we write t = qd + r, where 0 ^ r < d, and we see 
that 

n = n 0 4- c{qd + r)/d 
= n 0 + cq + cr/d 
= n 0 4- cr/d (mod c). 
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Thus every solution of (5-1-2) (and therefore, of (5-1-1)) is congruent 
to one of the d values of n given above. Hence there exist d= g.c.d.(a,c) 
mutually incongruent solutions of (5-1-1), if there exist any. 

We now state our knowledge of the congruence (5-1-1) as a 
theorem. 

Theorem 5-1: If d = g.c.d.(a,c), then the congruence 
an — b (mod c) 

has no solution if dtb, and it has d mutually incongruent solutions 
ifd\b. 

Example 5-2: Since g.c.d.(5,8) = 1, the congruence in Example 
5-1 has a solution (as we already know), and all solutions of it are 
mutually congruent. 

Example 5-3: Since g.c.d.(21,3) = 3 and 3+11, there exist no 
solutions of the congruence 

21x = 11 (mod 3). 

Example 5-4: Since g.c.d.(15,12) = 3 and 3 | 9, the congruence 
15x = 9 (mod 12) 

has exactly 3 mutually incongruent solutions. By inspection, we see 
that x = 3 is a solution. By Theorem 2-4, all solutions are therefore 
given by 

x = 3 + t^ = 3 + 4t t=. . ., -2,-1,0,1,2, . ... 

By letting t = 0,1,2, we obtain 3 mutually incongruent solutions of 
the congruence: 3, 7, and 11. 

We shall apply Theorem 5-1 to the particular case b = 1 after 
introducing two new terms. 

Definition 5-1: We say that a solution n of a congruence 
(such as (5-1-1)) is unique modulo c if any solution n' of it is con¬ 
gruent to n modulo c. 

Definition 5—2: If ad = 1 (mod c ), we say that a is the inverse 
of a modulo c. 

Corollary 5-1: If g.c.d.(a,c) = 1, then a has an inverse , and 
it is unique modulo c. 
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Proof: If g.c.d. (a,c) = 1, Theorem 5-1 implies that 
an = 1 (mod c) 

has a solution n, and that it is unique modulo c. ■ 

Example 5-5; Since 5 2 = 25 = 1 (mod 8), we see that 5 is its 
own inverse modulo 8. We observe that—3 and 13 are also inverses of 
5 modulo 8; this is not inconsistent with the fact that 5 has a unique 
inverse modulo 8, because —3 = 5 = 13 (mod 8). 

EXERCISES 

1. Find a complete set of mutually incongruent solutions of 
each of the following. 

(a) 7x = 5 (mod 11) 

(b) 8x s 10 (mod 30) 

(c) 9x = 12 (mod 15). 

2. Which of the following congruences have solutions? 

(a) 99x = 100 (mod 101) 

(b) 400898x = 22 (mod 400900) 

(c) 27x = 1 (mod 51) 

(d) 99x = 100 (mod 102) 

(e) 30x = 42 (mod 49) 

(f) 81x = 57 (mod 117). 

3. Find a, the inverse of a modulo c, when 


(a) a = 2 

and 

c — 5 

I'¬ 

ll 

o 

3 

and 

c— 9 

(c) a =12 

and 

c — 17. 


5-2 THE THEOREMS OF FERMAT AND 
WILSON REVISITED 

Our knowledge of congruences leads to proofs of Fermat’s little 
theorem (Theorem 3-4) and Wilson’s theorem (Theorem 3-5) that 
are less picturesque than the proofs using a combinatorial approach, 
but that provide more general results. 
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Theorem 5-2 (Eulers Theorem): If g.c.d. (a,m) = 1, then 
a 0{m) = 1 (mod ra). 

PROOF: Let r l9 r 2 , . .., r Mm) be a reduced residue system 
modulo m. We note that ar l9 ar 29 .. ., ar * (m) are all relatively prime to 
m; furthermore, they are mutually incongruent, since ar* = ar 5 
(mod m) implies that r* = r 5 (mod m), by the cancellation law 
(Theorem 4-3). We may thus pair each ar t with some r 5 such that 
ar t = fj (mod m), and we note that r 5 is uniquely defined for each ar t . 
Note also that each r 5 is paired with some ar u since there are 4>(ra) 
of the fj and (f>(m) of the ar { . Thus 


r x r 2 . . . r Mm) ^ ar t ar 2 . . . ar Mm) (mod m). 


Hence, if R = r 1 r 2 . . . then 


R = a Mm) R (mod m). 

Now g.c.d. (R,m) = 1, because R is a product of integers each of which 
is relatively prime to m. Thus 

a Mm) = 1 (mod m), 


by the cancellation law. ® 

Corollary 5-2 (Fermat’s Little Theorem , Theorem 3-4): If 
p is a prime , then 


n p = n (mod p). 

Proof: If p | n, then n p =0 = n (mod p). If P~\~u 9 then 
g.c.d. (p,n) = 1. Thus, by Theorem 5-2, 

n p ~ 1 = 1 (mod p), 

since <Mp) = P " 1- Multiplying both sides of this congruence by n, 
we find that 


n p = n (mod p). B 

We now prove a form of Wilson’s theorem that is stronger than 
the one given in Chapter 3. 

Theorem 5-3: The congruence (m — 1)! = — 1 (mod m) holds 
if and only if m is a prime. 
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PROOF: First suppose m is a prime, and consider the m — 1 
positive integers 1,2, — 1. Corollary 5-1 confirms that if a is 

such an integer, then there exists an inverse a of a modulo m (a is 
unique if we require that 1 ^ a ^ m — 1) for which 

ad = 1 (mod m). 

It may happen that a is its own inverse; in other words, that 

a 2 = 1 (mod m ). 

In that case, m\ (a — 1) (a-h 1); since m is prime, we see by Corollary 
2-3 that either m \ a — 1 or m | a +1; therefore, a = itl (mod m). In 
the product (m— 2) (m— 3) ... 2, we pair each number with its in¬ 
verse modulo p. Thus, for m — 7, we write that 5 • 4 • 3 • 2 = (5 • 3) • (4 • 2). 
In general, we see that 

(m — 1)! ss (m — 1) -1.1 (mod m) 

s m — 1 = —1 (mod ra). 

Conversely, suppose that m is not a prime. Then there exists an 
a( 1 <a<m) such that a | m; note also that a \ (m — 1)!. If (m— 1)! = — 1 
(mod m), then there exists an integer k such that (m — 1)! + 1 = km . 
Since a \ m and a | (m—1)!, it follows from the last equation that 
a | 1; but this is an impossibility, because a > 1. Thus the congruence 
(m — 1)! s — 1 (mod m) cannot hold if m is not a prime. ■ 


EXERCISES 

1. If m= 13, then a reduced residue system modulo m is 
1,2,3,4,5,6,7,8,9,10,11,12. Letting a~ 3, exhibit the pair¬ 
ing of each of the preceding numbers with the numbers 
in the reduced residue system 3,6,9,12,15,18,21,24,27,30, 
33,36, as described in the proof of Theorem 5-2. 

2. If m— 11, then a reduced residue system modulo m is 
1,2,3,4,5,6,7,8,9,10. Exhibit the pairing of each of the 
preceding numbers with its inverse modulo m as de¬ 
scribed in the proof of Theorem 5-3. 

3. Prove that 1 + a + a 2 4- . . . + a 0<m)_1 = 0 (mod m) if 
g.c.d.(a,m) = 1 and g.c.d.(a — 1, m) = 1. 

4. Prove that if r u r 2 ,. . ., r 0{m) is a reduced residue system 
modulo m, and m is odd, then r x + r 2 + . . . + r 0(m) = 0 
(mod m). 
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5. What is the remainder when 41 75 is divided by 3? 

6. What is the remainder when 473 38 is divided by 5? 

7. Prove that if A = o 0 10" + aJO”" 1 + . . . + a„ and S = a 0 + 
Oj + . . . + a n , then A = S (mod 9). (This result is the basis 
for the computational check called “casting out nines”.) 

8. Suppose that p is a prime, and that p~{a; use Euler’s 
theorem to prove that x = a p ~ 2 b is a solution of 

ax = b (mod p). 

9. Prove that if p is a prime congruent to 1 modulo 4, then 

( - g X ) ! 2 = -1 (mod p). 

[Hint: Prove that ((p —l)/2)! 2 = (p — 1)! (mod p).] 

10. Use Exercise 9 to find a solution of each of the following 
congruences. 

(a) x 2 = — 1 (mod 13) 

(b) x 2 = — 1 (mod 17). 

11. Prove that for each odd prime p and for each a 
(0 — a — p — 1), 

a 1) ~ ^ moc ^ P)* 

12. Prove that for each prime p (n < p ^ 2n), 

( 2 ;) = 0 (mod p), 


but that 

(*;) + 0 (mod p 2 ). 

13. Is 2 P ~ 1 = 1 (mod p 2 ) for p=2? for p = 3? for p = 5? for 
p = 7? for p = 11? (After completing Exercise 13 you may 
be interested to learn that 2 1092 s 1 (mod 1093 2 ).) 
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14. For each m greater than 1, how many primes are there in 
the closed interval [m! + 2,ra! + m] ? 

15. Suppose p denotes a prime congruent to 3 modulo 4; use 
Wilson's theorem to prove that 

(^— 2 ~) ! 2 = 1 (mod p). 

[Hint: See the hint for Exercise 9.] 

16. Find the least positive integer n that satisfies each of the 
following congruences. 

(a) 3 56 = n (mod 7) 

(b) 7 38 = n (mod 11) 

(c) 7 128 = n (mod 13). 

17. Decide whether or not 17 is a prime by determining 
whether 16! is congruent to —1 modulo 17. Is this an effi¬ 
cient test for determining whether or not 1093 is a prime? 

18. Let A(m) be the least positive integer such that for each 
integer a relatively prime to m, 

a \(m) = i ( moc l 

Compute A(m) for each m (1 < m ^ 10). Is A(m) ^ $(m) 
for each m? Is A(m) < <f>(m) for some m? 

19. In 500 B.c. the Chinese seem to have known that 2 P = 2 
(mod p) for each prime p. They also assumed that if 
2 n = 2 (mod n), then n is prime. 

(a) Show that 341 is not a prime. 

(b) Show that 2 10 = 1 (mod 341). 

(c) Show that 2 341 = 2 (mod 341). 

20. Let <!> n = 2 2 ” -4-1. 

(a) Prove that <I> n | (2 2?l + 1 — 1). 

(b) Prove that if a \ fo, then (2 a — 1) | (2 6 — 1). 

(c) Prove that (2 2 " +1 - 1) | (2 22 ”-l). 
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(d) Prove that (2 22 "-l) | ( 2 22 ” +1 -2). 

(e) Prove that <t> n | (2— 2). 

(f) P. Fermat observed that =5, <$ 2 = 17, <J> 3 = 257, 
and 3> 4 = 65537 are all primes, and he suggested that 
<!> n is prime for each n. Discuss Fermat’s suggestion 
in the light of Exercises 19 and 20(e). 

21. Prove that if p denotes an odd prime, then 

= ±1 (modp). 

22. Prove that if p denotes an odd prime, then the product of 
all the odd integers less than p is congruent either to 1 or 
to —1 modulo p. [Hint: If p — 2Z + 1, then (p — 1)! = 
2 l l\ • 1 • 3 • 5 •... • (2/ — 1).] 

23. Prove that if p denotes an odd prime, then the product of 
all the even integers less than p is congruent either to 1 
or to —1 modulo p. 


5-3 THE CHINESE REMAINDER THEOREM 

Having considered single linear congruences in Section 5-1, we 
now turn to the problem of finding solutions of systems of linear con¬ 
gruences. A solution of the system of congruences 

a x x = b x (mod mj, 
a 2 x s b 2 (mod ra 2 ), 


and 


a s x = b s (mod m s ) 


is an integer that satisfies each congruence in the system. 

The simplest examples of such systems arise in the solution of 
single linear congruences with large moduli. Let m have the prime 
factorization 

m = p! ei p 2 e2 . * • Ps €s ; 


then, as a consequence of the fundamental theorem of arithmetic, 
m | n if and only if pf * | n for each i. Hence, 
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A = B (mod m) 

if and only if all of the congruences 

A = B (mod Pi 1 ), 
A = B (mod p 2 e2 ). 


and 


A = B (mod Ps s ) 


hold. It follows that the congruence 

ax = b (mod m) (5-3-1) 

has the same set of solutions as the system of simultaneous con¬ 
gruences 


and 


ax = b (mod p t €l ), 
ax = b (mod p 2 e2 ), 

(5-3-2) 


ax 35 b (mod p/ s ). 

Although there are several congruences to solve in (5-3-2), 
their moduli are generally much smaller than m, and, as we shall see, 
computations are thus simplified. 

Example 5-6; Let us replace the congruence 

3x = 11 (mod 2275) 

by a system of linear congruences with smaller moduli. Since 
2275 = 5 2 • 7 • 13, our congruence may be replaced by the system 


3x = 11 (mod 25), 

(5-3-3) 

3x = 11 (mod 7), 

(5-3-4) 

3x m 11 (mod 13), 

(5-3-5) 


and 
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The basic existence theorem for systems of congruences is the 
Chinese Remainder Theorem, which has this name because Sun-Tsu 
is believed to be the first mathematician who studied special cases 
of this theorem. The proof we shall give furnishes us with a method 
for constructing solutions of a system of congruences from solutions 
of the individual congruences in the system. 

Theorem 5-4: Suppose m l9 m 2 , .. ., m s are s integers , no two 
of which have a common factor other than 1. Let M — m 1 m 2 . . . m s , 
and suppose that a u a 2 ,. . ., a s are integers such that g.c.d. (a* ,m f ) = 1 
for each i. Then the s congruences 

a t x = b i (mod m x ), 
a 2 x = b 2 (mod m 2 ), 


and 

a s x = b s (mod m s ) 

have a simultaneous solution that is unique modulo M. 

PROOF: From the solutions of each particular congruence, we 
shall construct one common to the entire set. First we choose in¬ 
tegers c l9 c 2 , . . ., c s such that 

a % Ci = bf (mod m*) 

(such integers c t exist by Theorem 5-1). Now let n { — M/m Since no 
two of the m t have a common factor, we see that g.c.d. (n*, m { ) = 1. 
Hence, by Corollary 5-1, there exists an n* such that = 1 (mod 

We now show that the number x 0 defined by 

x 0 = c x n x h x + c 2 n 2 h 2 + . . . + c s n s h s 

is a solution of the original system of s congruences. First note that 
mi divides each Uj except for n*. Thus 

a { x 0 = a i c 1 n 1 fi 1 + a { c 2 n 2 h 2 + . . . + aiC s n s n s 
= aiCiUihi (mod m { ) 

= afCi (mod m t ) 

= b t (mod mi). 


Hence, x 0 is a solution of each congruence. 
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If y is a solution of the system of s congruences, then, by Theorem 
5-1, Xo = Ci s y (mod rrii). Hence m { \ x 0 — y for each m t ; since no two 
of the mi have a common factor, m t m 2 . . . m s \ x 0 — y, that is, 
M | x 0 — y. Thus y = x 0 (mod M). 

Example 5-7: We now use the Chinese Remainder Theorem to 
solve the problem posed in the preface to this chapter; namely, what 
is the least positive x such that 

x = 2 (mod 3), 
x = 3 (mod 5), 

and 

x = 2 (mod 7)? 

In terms of Theorem 5-4, a 1 = a 2 — a 3 = 1; c x — 2, c 2 — 3, and 
c 3 = 2; m x = 3, m 2 = 5, and m 3 = 7; M = 105; nj = 35, n 2 = 21, and 
n 3 = 15. Now 35ni = 1 (mod 3), that is, 2 n x = 1 (mod 3); thus we may 
let n x = 2. Next, 21n 2 — 1 (mod 5), and so we may let n 2 = 1. Finally, 
15n 3 = 1 (mod 7), and we may let n 3 = 1. Consequendy, a solution 
of the system is 


x 0 = 2 • 35 • 2 + 3 • 21 • 1 + 2 • 15 • 1 
= 140 + 63 + 30 = 233. 

We conclude that all solutions are congruent to 233 (mod 105), 
and that the least positive solution is 23. 

Example 5-8: Let us now solve the congruence considered in 
Example 5-6. Since g.c.d.(3,2275) = 1, the congruence has a solu¬ 
tion. We note that x = 12 is a solution of (5-3-3); x = 6, a solution 
of (5-3-4); and x = 8, a solution of (5-3-5). (These particular solutions 
can easily be obtained by inspection.) Using the notation of Theorem 
5-4, we see that 


and 


c x = 12, 
c 2 = 6, 

c 3 = 8. 


Next we must find n l9 n 2 , and n 3 such that 


^ n t = 9ln x ss 1 (mod 25), 


25 
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n 2 = 325n 2 = 1 (mod 7), 

and 

n 3 = 175n 3 a 1 (mod 13). 


Equivalently, we must solve the simpler congruences 


and 


16n x = 1 (mod 25), 
3n 2 s 1 (mod 7), 

6n 3 = 1 (mod 13). 


By inspection, we see that n x = 11, n 2 — 5, and n 3 = 11 are solutions. 

We now use the foregoing results to obtain a solution of the 
original congruence 

3x s 11 (mod 2275). 


It is 

*o = 12 • 91 • 11 + 6 • 325 • 5 + 8 • 175 • 11 
= 12012 4- 9750 + 15400 
= 37162 - 762 (mod 2275). 

Hence we see that 762 is the smallest possible solution of the 
congruence. 

In summary, we observe that to arrive at this answer we had to 
solve six congruences; however, the moduli involved were small, 
so that each of the six congruences was easy to solve by inspection. 


EXERCISES 


1. Find all solutions of each of the following systems of 
congruences: 


(a) x = 1 (mod 2) 
x s 2 (mod 3) 
x = 3 (mod 5) 

(b) x = 1 (mod 3) 
x = 3 (mod 5) 
x = 5 (mod 7) 


(c) 3x^1 (mod 5) 
4x = 6 (mod 14) 
5x = 11 (mod 3) 

(d) 4x = 2 (mod 6) 
3x = 5 (mod 7) 
2x = 4 (mod 11) 
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(e) x = 1 (mod 3) 
x = 2 (mod 4) 
x = 3 (mod 7) 
x = 4 (mod 11) 


(f) 2x^1 (mod 5) 
3x = 9 (mod 6) 
4x = 1 (mod 7) 
5x = 9 (mod 11), 


2. Find the least positive integer that yields the remainders 
1, 3, and 5 when divided by 5, 7, and 9, respectively. 


3. In Exercise 2 of Section 8-1, it is asserted that if 3> rt = 
2 2 ”+ 1, then g.c.d.(O n , 3> m ) = 1 whenever n ¥* m. Using 
this assertion, prove that for each n there exist n consecu¬ 
tive integers N, N+l,...,N+n — 1 such that the first is 
divisible by <£*; the second, by 0 2 ; the third, by <J> 3 ; . . .; 
and the nth, by <I> n . When n = 2, what is the least possible 
value of N? When n = 3? 


4. Prove that for each n there exists a sequence of n consecu¬ 
tive integers each of which is divisible by a perfect square 
exceeding 1. 

5. Find three consecutive integers the first of which is 
divisible by the square of a prime; the second, by the cube 
of a prime; and the third, by the fourth power of a prime. 

6. A class in number theory was to divide itself into groups of 
equal sizes to study the Chinese Remainder Theorem. 
When the class was divided into groups of 3, two students 
were left out; when into groups of 4, one was left out. When 
it was divided into groups of five, the students found that if 
the professor was added to one of the groups, no one was 
left out. Since the professor had never really understood 
the Chinese Remainder Theorem when he was in college, 
the last arrangement worked out nicely. How many stu¬ 
dents were there in the class? 


5-4 POLYNOMIAL CONGRUENCES 

After a study of linear congruences, it is natural to examine con¬ 
gruences of the form 


where 


f(x) = 0 (mod m), 


/(x) = a 0 x n + diX 71 " 1 H- . . . + a 
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( a 0 7 *~ 0; all the a { are integers). The function/(x) is called a. poly¬ 
nomial of degree n with integral coefficients. 

We shall not delve extensively into polynomial congruences; 
however, the subject contains an important theorem of Lagrange that 
we shall need when we study primitive roots. 

Theorem 5-5: Iff(x) is a polynomial of degree n with integral 
coefficients (that is, iff(x)~a 0 x n -\-a 1 x n ~ 1 ~b. . . + a„), and if p is a 
prime such that p“|~ a 0 , then the congruence 

f(x) = 0 (mod p) 

has at most n mutually incongruent solutions modulo p. 

PROOF: We proceed by mathematical induction on the degree n. 
If n = 0, then f(x) = a 0 . Since a 0 #0 (mod p), there are no solutions. 
If n = 1, then f(x) = a 0 x + a x ; and we know by Theorem 5-1 that the 
congruence 


a 0 x = —a x (mod p) 

has exactly one solution (mod p), because g.c.d.(a 0 ,p) = 1. 

Assume now that we have proved the theorem for all polynomials 
of degreen less than k , where k ^2. Suppose that f(x) = a 0 x k + .. . + a k 
is a polynomial of degree k with k + 1 mutually incongruent solutions 
modulo p, say w l9 w 2 , . . ., w k + 1 . We define a polynomial 

g(x) =f(x) — a 0 (x — w x ) (x — tv 2 ) . . . (x — w k ). 

Now g (x) — f(x) = 0 (mod p) if x = w l9 w 2 ,..., w k . Furthermore, 
g(x) is a polynomial of degree less than fc, because the leading term 
a 0 x k of f(x) is cancelled by the term a Q x k in the product on the right. 
Thus, since the number of solutions of the congruence g (x) = 0 
(mod p) is larger than the degree of g (x), it follows from the induction 
hypothesis that 


g (x) =0 (mod p) 
for each integer x. In particular, 

g(w k + x ) = 0 (mod p). 


Hence 
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0 = g(w k + 1 ) (mod p) 

— f(w>k + i) ~ao(w k+1 — Wi) (w k + 1 ~ w 2 ) . . . (w k+1 -w k ) (mod p) 

= -«o(u; fe + 1 -u; 1 )(u; fc + 1 -tt; 2 ) . . . (w k + 1 -w k ) (mod p). 

Thus p | a 0 (tt>fc + i — w t ){w k + 1 — w 2 ) ... (w k + l — w k ), so that, by 
Corollary 2-4, p divides one of the factors. However, p/a 0 > and since 
the Wf are mutually incongruent, the difference between any two 
is not divisible by p. This contradiction leads us to conclude that the 
congruence f(x) = 0 (mod p) has at most k mutually incongruent 
solutions, and our theorem is established. ■ 


EXERCISES 

1. Find all solutions of each of the following congruences. 

(a) x 2 + x + 1 =0 (mod 11) 

(b) x 3 + x + 1 =0 (mod 13) 

(c) x 4 + x 3 + 2 = 0 (mod 7). 

2. Prove that if f(x) = a 0 x n + a x x n ~ x 4-. . . + a n and m is an 
integer, then k \ |/ (fc) (m), where f (k) is the kth derivative 
of/ 

3. From calculus recall that the Taylor series for the poly¬ 
nomial 


f(x) = a 0 x n + ajx” 1 + . . . + a n 

f(x + h) =/(*) +/'(x)fc+£§p+ . .. + f ln) M h \ 

Prove that if a 0 , a l9 . . ., a n , m, r, are integers, and p is a 
prime, then 

f(m + rp s ) = f(m) -j- rp s f' (m) (modp s + 1 )* 

4. Suppose that f(m) = 0 (mod p s ) and that p1~/'(m), where 
p is a prime. Use Exercise 3 to prove that there exists an 
r (unique modulo p) such that 


f(m + rp s ) =0(mod p s+1 ). 
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5. Use Exercise 4 to determine the number of solutions of the 
congruence 

x 7 + x 4-1 = 0 (mod 343). 

6. A polynomial is said to be monic if its leading coefficient is 
1. A monic polynomial/(x) of degree n is called reducible 
modulo p if there exist nonconstant, monic polynomials 
g(x) and h(x ), each of degree less than n, such that 

f(x) = g(x)h(x) (mod p). 

Is x 4 -fl reducible modulo 2? modulo 3? modulo 5? 
modulo 7? modulo 11? modulo 13? Contrast your answers 
with the fact that x 4 + 1 is irreducible in the sense of ele¬ 
mentary algebra. 

7. Use Eulers theorem and Theorem 5-5 to prove that 

x p ~* — 1 = (x - l)(x-2) . . . (x-p + 1) (mod p) 
for each integer x and each prime p. 

8. What may one deduce from the result given in Exercise 7 
by setting x = p? 




CHAPTER 6 


ARITHMETIC FUNCTIONS 


Any function whose domain is some set of integers is called an 
arithmetic function. In Chapters 4 and 5, we encountered <f>(n ), the 
Euler ^-function. Other arithmetic functions we shall meet are the 
number d(n) of divisors of n, and the sum o-(n) of the divisors of n. 

We shall use combinatorial techniques (see Chapter 3) to establish 
multiplicative and other properties of <f>(n ), d{n ), and o*(n). Then, 
we shall generalize our results by means of the Mobius function p(n). 

6-1 COMBINATORIAL STUDY OF 

In Chapter 4, we met the Euler ^-function <£(n) in connection 
with reduced residue systems modulo n. Recall that <f>(n) is defined 
as the number of positive integers not exceeding n that are relatively 
prime to n. 

Example 6-1: In Example 4-6, we showed that </>(p) = p — 1 
for any prime p. It is almost as easy to determine <t>(p n ) for each 
positive integer n and each prime p. We proceed by counting the 
positive integers not exceeding p n that are not relatively prime to p n . 
Clearly, an integer m has a factor greater than 1 in common with p” if 
and only if p | ra. Hence, the positive integers less than or equal to 
p n that are not relatively prime to p n are the p n_1 multiples of 
p: p, 2p, 3p, . . p n ~ 1 * p. Therefore, there are p n — p n_1 positive 
integers not exceeding p n that are relatively prime to p n . That is, 

*(p*) = p"-p"-i = p" (i“). 

For integers other than prime powers, the values of the ^-function 
are seemingly irregularly distributed (see Table 6-1); nevertheless, 
in Theorem 6-2 we shall be able to obtain an explicit formula for <j> (n) 
by generalizing the combinatorial procedure of Example 6-1. 
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Example 6-2: From Example 6-1, we see that 
1 + 4 >(p) + 0(p 2 ) + • • ■ + 4>(p n ) = 1 + (p — 1) + (p 2 — p) + . . . 

+ {p n -p n ~ 1 ) = p n , 

since the terms in the penultimate expression mutually cancel each 
other save for p n . 

In Theorem 6-1, we shall establish an important formula involv¬ 
ing the Euler 0-function that generalizes Example 6-2. 

Theorem 6—1: 0(d) = n. 

din 

Notation: ^ 4>(d) denotes the sum of the values of the 

d | n 

0-function taken for all the divisors of n. For example, when n = 6, 
2 0(d) = 0(1) +0(2) + 0(3) +0(6) = 1 + 1 + 2 + 2 = 6. 

die 

PROOF: We use a combinatorial argument. Let S n denote the set 
{1,2,.. .,n}. If we let #(S n ) denote the number of elements of S n , 
then clearly 

#(S n ) = n. 


For each d that divides n, we denote by T d (n) the set of positive 
integers not exceeding n whose greatest common divisor with n is d. 
Taking n = 6, for example, we see that T x { 6)= {1,5}, T 2 (6) — {2,4}, 
T 3 (6) = {3}, and T e (6) = {6}. Clearly, for each n the various T d (n) 
have no common elements. Furthermore, for any m E S n , we see that 
m E T d (n) where d = g.c.d. (m,n). Consequently, 

n = #(S n ) = ^ #{T d (n)}. 

din 


We now show that T d (n) has 0(n/d) elements. First note that all 
elements of T d (n) are multiples of d, and are less than or equal to n. 
Thus the elements of T d (n) are found among the numbers d, 2d, . . ., 
(n/d) d. Now if g.c.d.(a,n/d) = e, then clearly g.c.d.(ad,n) = ed 9 and 
ed = d if and only if e = 1. Therefore, the only numbers ad in T d (n) 
are those for which g.c.d.(a,n/d) = 1, there being 0(n/d) such 
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Table 6-1: Values of <f>(n), d(n), and <r(n)(l n =£20). 


n 

<M«) 

d(n) 

<r(n) 

1 

1 

1 

1 

2 

1 

2 

3 

3 

2 

2 

4 

4 

2 

3 

7 

5 

4 

2 

6 

6 

2 

4 

12 

7 

6 

2 

8 

8 

4 

4 

15 

9 

6 

3 

13 

10 

4 

4 

18 

11 

10 

2 

12 

12 

6 

6 

28 

13 

12 

2 

14 

14 

4 

4 

24 

15 

8 

4 

24 

16 

8 

5 

31 

17 

16 

2 

18 

18 

6 

6 

39 

19 

18 

2 

20 

20 

8 

6 

42 


numbers, by definition of the (^-function. Hence 

n = #(S„) = 2 #{T d (n)} = 2 <t>(n/d). 

din d\n 


We note that 


2 <Hnld)='2 <Md), 

din din 

for as d assumes the values of the various divisors of n, so does the 
complementary divisor n/d. Thus we have proved our theorem. ■ 

We now define the Mobius function p(n), which will be useful 
in the next theorem. 

Definition 6-1: 

f 1 if n == 1, 

p{n) = | 0 if p 2 | n for some prime p, 

[ (—1) r if n = V\Vi • • • p r where the p x are distinct primes. 

Example 6-3; /x(2) = —1, p(3) = —1, p(4) = 0, 5) = —1, and 

p(6) = 1. 
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The first part of the following theorem has formal similarities to 
Theorem 6-1. 


Theorem 6-2: = p{d) 4= n y[ (l——)• 

d\n a Pin ' V' 

NOTATION: (i denotes the product of all numbers of 

the form ^1 — — j with p taking as values die distinct prime divisors 

n 

of n. Sometimes we may use the symbol Y\ A*?; this denotes the prod¬ 
uct AjA 2 . . . A n . 1=1 


PROOF: We proceed by mathematical induction on the number 
of prime factors of n. If n has one prime factor, say n = q a , then, by 
Example 6-1, 

<Hn) = 4>(q") = q*-q*- 1 . 


On the other hand, substitution of the definition of n gives us two 
equalities: 


£ /i(d)4=At( l)q« + v{q)q a ~ l + n{q*)q a ~ 2 + . . . + 

d I n 

= q a — q + 0 + . . . + 0 
= q a — q a ~\ 

and 

Consequently, when n has one prime factor, all three of the ex¬ 
pressions in the statement of Theorem 6-2 have the same value. 

Now let us assume that the theorem is true for each integer with 
k or fewer distinct prime factors. Suppose n = n'p**, where n' has k 
distinct prime factors and p is a prime that does not divide n f . Then, 
by the induction hypothesis, 

*(»'>=2 n (i-i)- 

din U Pin' X 

Let us now divide the set {1,2,.. .,n} into p a subsets, each con¬ 
sisting of n' consecutive integers. Each subset contains ) in¬ 
tegers that are relatively prime to n’. Now, of the p a <f>{n') positive 
integers in {1,2,.. ,,n} that are relatively prime to n', the only ones 
having a common factor with n are the p“ -1 $(n') integers that are 
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multiples of p. Hence, 

4 >(n) = p a </>(n') — p" _1 </>(n'). (6-1-1) 

For example, let n = 30 = 2 -3 -5 and n' =6 = 2 *3; below, we have 
underlined the 5 • </>(6) numbers in {1,2,.. .,30} that are relatively 
prime to 6. 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 
19 20 21 22 23 24 25 26 27 28 29 30 


To obtain the 5 1_1 </>(6) = </>(6) numbers in {1,2, ...,30} that are 
relatively prime to 6 but not to 5, we take all of the numbers less than 
or equal to 30/5 = 6 that are relatively prime to 6, and multiply each 
by 5. Thus we obtain 5*1=5 and 5*5 = 25. The exclusion of these 
numbers from those underlined above leaves 1, 7, 11, 13, 17, 19, 23, 
and 29; that is, all numbers in {1,2,... ,30} that are relatively prime 
to 30. 

Hence, by (6-1-1), 

</>(n) = p a 4>(n') — p a_1 </>(n') 


[here we have used the obvi¬ 
ous fact that if then 

P'(pd) = 


Pt d Ptd 


2 4 2 

din' u V din' 


din U din' 


d 

pfd 


2 ^+2.^)^ 


dl n 
pfd 


pd I n 
Ptd 




d I n 


pdl n 


+ 2 n(p*d)-p2 + 

P 2 d I n P a 

Ptd 


2 

p a d In " w 


Ptd 


= 2>(d)§. 

din a 





80 


ARITHMETIC FUNCTIONS 


The penultimate expression may give the appearance of arising from 
nowhere. Its first two terms are identical with the preceding line; the 
new terms are equal to zero, since p(N) = 0 if q 2 1 N for some prime q, 
and since, in the new terms, p 2 (or a higher power of p) appears in 
the argument of p. 

Again from (6-1-1), we see that 

</>(n) = p a <j)(n f ) — p a-1 </>(n') 

= P“0(»') (l“) 

=p “"'( 1 ~p)n( 1 -|) 

— n (!-£)• 

Pin x H/ 

Thus our theorem is established. ® 

Let us derive a new formula for <£(n) from the second expression 
in the statement of Theorem 6-2. Suppose that 

n = Pi ei p 2 ez • • • P/ S , 
where the p* are distinct primes; then 

<Hn)=n n (l") 

Pin ' p/ 


= Pi'Pi 2 • • • Ps* n 

( 6 - 1 - 2 ) 

=n 

i= 1 


= fl ^>(Pi ei ) = </>(Pi ei )0(p2 e2 ) 

i= 1 

Thus 0(h) is the product of the values assigned to the 0-function for 
the prime powers in the factorization of n. This provides an easy way 
to remember the formula for 0(n). 

Example 6-4: 0(120) = 0(2® -3-5) 

= 0(2®) 0(3) 0(5) 
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= (2 3 — 2 2 ) • (3-1) • (5-1) 

=4-2-4 
= 32. 

EXERCISES (* means difficult) 

1. Prove that if $(m) | m — 1, then there exists no prime p 
such that p 2 | m. 

*2. Prove that if m is not a prime and <£(m) | m — 1, then m 
has at least three distinct prime factors. 

*3. Prove that if m is not a prime and \ m — 1, then m 

has at least four distinct prime factors. 

4. Prove that <f>(m) is even if to > 2. 

5. Prove that if n has r distinct prime factors, then 
<f>(n) ^ n2~ r . 

6. Find all integers n such that <j>(n ) = 12. 

7. C. Goldbach conjectured that every even number greater 
than 2 is a sum of two primes. P. Erdos conjectured that, 
for any even number 2n, there exist integers q and r 
such that 

<t>(q) + </>(r) = 2n. 

Does the conjecture of Goldbach imply that of Erdos? 

8. Suppose g.c.d. (m,n) = 1. Prove that if x assumes all 
values in a reduced residue system modulo m, and y 
assumes all values in a reduced residue system modulo n, 
then nx + my assumes all values in a reduced residue 
system modulo mn. 

9. Use Exercise 8 to prove that if g.c.d. (m,n) = 1, then 
( j>(mn ) = </>(m)</>(n). 

10. Use Exercise 9 to give a new proof that 

0(n) = ».n 

Pin x 
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11. Characterize all pairs of integers rrt and n for which 
<£(mn) = <£(n). 

12. R. D. Carmichael conjectured that for each integer n there 
exists an m different from n such that 

<£(n) = <£(m). 

(a) Prove Carmichael's conjecture for each n congruent 
to 2 modulo 4. 

(b) Prove Carmichael's conjecture for each n less than 20. 

13. Find infinitely many integers n for which 10|<£(n). 
[Hint: <£(11) = 10.] 

14. Prove that there are infinitely many integers n for which 
<£(n) is a perfect square. 

15. Evaluate <£(19), <£(49), <£(243), and <£(1024). 

6-2 FORMULAE FOR d(n) AND cr(n) 

We now turn our attention to the number d(n) of (positive) 
divisors of n, and to the sum cr(n) of these divisors. As is the case with 
<£(n), these functions are easily evaluated when n is a prime power. 

Example 6-5; If p denotes a prime, then the positive divisors of 
p are 1 and p. Thus d(p) = 2, and <r(p) - p + 1. More generally, the 
positive divisors of p n are 1, p, p 2 , . .. ,p n . Hence d(p n ) = n4* 1, and 

o-(p n )=l + p4-p 2 +. . . + P n 

^n+l _ 1 

_ P --— (by Theorem 1-2). 

P~ 1 

The following theorem establishes general formulae for d(n) 
and cr(n). 

Theorem 6-3; If n = p? x . . . p s as , then 

d(n) = (ai 4* 1) (a 2 -hi)... (a s 4- 1), 


o-(n) = 


P i“l 


1 

P 2 “l 


p « s +i _l 


and 
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Proof: As before, we proceed by mathematical induction on 
the number of prime factors of n. If n has one prime factor, say 
n — p a , then 

d(n) = d(p a ) = a + 1, 

and 

— 1 

<r(n) = <r(v a ) = - — i , 

by Example 6-5. 


Now assume that the theorem is true whenever n has k or fewer 
distinct prime factors. Let n — n’p a where n' has k distinct prime 
factors and p, which is a prime, is not a factor of n'. If 

n ' = Pi ai p 2 a2 • - • Vk k , 

then 

d(n') = («! -f l)(a 2 + I) . . . (a* + 1) 

and 


„(„>) = P“ 1 + 1 -~ 1 Ps a2+1 -1 Pfc“ fc + l -l 
( ^ Pi-1 P.-1 p*-l • 


Let dj, <i 2 , • • d s denote the d(n’) divisors of n'. Then the 
divisors of n are d u d 2 , ..d s , pd u pd 2 , ..., pd s , p 2 d x , p 2 d 2 , .. 
p 2 d s , . . p a d 1 , p a d 2 , . . ., p a d s . Thus 

d(n) = d(n’)(a + 1) 

= («! + 1) (a 2 + 1) . . . ( a k + 1) (a + 1), 

and similarly, 

<r(n) = cr(n') + pcr(n') + . . . + p“cr(n') 

= o-(n')(l + p + ... + p tt ) 



Thus, our theorem follows by mathematical induction. ■ 


Corollary 6-1: If n = p x ai p 2 a *. .. p s “», then 
d(n) = d{p x ai )d{p 2 a *) . . . d(p s a *) 
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cr(n) = o-{pi ai )(r(.pf*) ■ ■ ■ cr(p s “»). 

Example 6-6: d (120) = d (2 3 • 3 • 5) = d (2 3 ) d (3) d (5) = 4 • 2 • 2 = 
16, and o-(120) = • f£y= 15 • 4 -6 = 360. 

EXERCISES (° means difficult) 

1. Prove that d(n) is odd if and only if n is a perfect square. 

2. Prove that d = n din)li . 

d I n 

3. Prove that if n = Pt ai p 2 a2 - • • Pr“ r , then 

(r(n)(/)(n) = n 2 (l - Pf ai_1 ) • • • (1 — Pr “ r 1 ) * 
and 

* («) * (n) > n2 (i _ f?) (i - ??) • • • (i" h) ■ 

4. Prove that the number of ways of writing n as a sum of 
consecutive integers equals d(m) where m is the largest 
odd number dividing n. (For example, if n = 15, then 
15, 7 + 8, 4 + 5 + 6, and 1 + 2+ 3+ 4+ 5 are the ways 
of writing 15 as a sum of consecutive integers, and there¬ 
fore cZ (15) = 4.) 

5. Prove that cr(n) = d(m ) (mod 2) where m is the largest 
odd factor of n. 


*6. Prove that if n = p, ai p 2 “ 2 • • • Pr° r , then 


2 d<Hd) = 

d I n 


1 + 1 + 1 

Pi +1 


p2 a r +l -j- l 

Pr+ 1 


*7. Prove that if n = Pi 0l p 2 “ 2 • • • Vr r , then 

2 ed(e) 

el n 

-n {(oj+i)p“ i+2_ (flj+2)pj°i +i + i }(pj- i ) _2 - 
i= 1 

8. Prove that d(n) <2 Vn. [Hint: Prove that each factoriza¬ 
tion of n has one factor that does not exceed Vn.] 
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9. If <r(n) = 2n, n is a perfect number. Prove that if n is a 
perfect number, then -4 = 2. 

din “ 


10. Evaluate cr(210), <#>(100), and <r(999). 


11. Evaluate d(47), d( 63), and d(150). 


12. Prove that + l j s an integer if n is a prime, and 

n 

that it is not an integer if n is divisible by the square 
of a prime. 


13. Let S(n ) denote the number of positive integers not ex¬ 
ceeding n that are square-free (that is, not divisible by the 

square of a prime). Compare S (n) and ~ for 

n ^ 20. Can you conjecture which is always the larger? 


14. Show that Goldbach’s conjecture (see Exercise 7 of 
Section 6-1) implies that for each even number 2m there 
exist integers m 1 and m 2 such that 

a(m 1 ) + cr(m 2 ) ~ 2m. 


15. Prove that for each integer n, there exist integers n x and 
n 2 such that 


d{n x ) + d(n 2 ) = n. 


6-3 MULTIPLICATIVE ARITHMETIC FUNCTIONS 

The four functions we have considered in this chapter belong to 
the class of multiplicative arithmetic functions. 

Definition 6-2: An arithmetic function f{n) is said to be 

multiplicative if 


f(mn) =f(m)f(n) 
whenever g.c.d. (m,n) = 1. 

Theorem 6-4: </>(n), d(n), o-(n), and fi{n) are multiplicative 

arithmetic functions . 
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Proof: Suppose g.c.d. (m,n) = 1. If 

n = pf 1 . . . p r n '' and m — qf 1 . . . q/* 

are the prime factorizations of n and m, then no )), can occur among 
the q t . Hence, by equation (6-1-2), 

<j)(nm) = <t>{Pi a ')4>(P2 a2 ) • • • <t>(Pr ar )<t>(Qi 3l )<t>(q 2 <2 ) ■ ■ ■ 4>(Qs es ) 
= <f>(n)<f>(m). 

By Corollary 6-1, 

d(nm) = dipf^diPi**) .. . d(p r ar )d(q 1 l3l )d{q 2 lii ) ■ ■ ■ diqj**) 

= d(n)d(m). 

Also by Corollary 6-1, 


a-(nm) = cr(p 1 “ , )o-(p 2 0 ' 2 ) • • • cr{p r ar )a-{q^)<r{q 2 ^) .. . o-(q/») 

= <x(n) cr(m). 

Finally we note that p,(mn ) = 0 = p.(m) p-(n) if any of the exponents 
exceeds 1; if all the a t and y3 ( equal 1, then p(mn) = (—l) r+s = 
p,(n)p.(m). ■ 

EXERCISE 

1. Suppose/(n) and g(n) are multiplicative and that/(p r ) = 
g(p r ) for each r and each prime p. Prove that/(n) =g(n) 
for all n. 

6-4 THE MOBIUS INVERSION FORMULA 

In Section 6-2, we saw two similar formulae related to 4>(n), 
namely 

n = ^ 4>(d) and <£(n) = p{d) 

d I n d I n ' 

These two formulae represent a special case of a general theorem on 
the Mobius function. However, before we can prove this general 
theorem, we must prove a special result about the Mobius function. 

Theorem 6-5: 2 ^ = {o tf n > 1. 

din k J 
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PROOF: The assertion is clearly true if n — 1. We proceed by 
mathematical induction on the number of different prime factors of n 
when n > 1. 

First, if n = p a , then 

2 p(d) = Ml) + Mp) + Mp 2 ) + • • • + Mp“) 

d I n 

= 1 - 1+0 + . . .+0 
= 0 . 

Suppose the theorem is true for integers with at most k prime 
factors. Assuming that n=n'p a , where n' has k distinct prime factors 
and p is a prime that does not divide n', we have the equation 

2 nid) = 2 Md) + 2 p(P d > + X ^ 2 d) +... + 2 n(P a d) 

d\n d I n r d\n' din' d\n' 

= ^ p.{d) -^/t(rf)+0 + ...+0 

d\n’ d\n' 

= o. ■ 

Theorem 6-6 (Mobius Inversion Formula): If two arithmetic 
functions f(n) and g(n) satisfy one of the two conditions 

/(»)=2 g(d) 

d I n 

and 

«(n)=2 mW)/(3) 

din ^ U// 

/or each n, fhen they satisfy both conditions. 

Proof: First suppose that 

/(n)=2 *(<*); 

d I n 


din ' a/ dd' = n 

= X X 

dr/' = n p\ d' 


then 
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= 2 ti(d)g(e) 

deh = n 

= X s(e) x 

eh' = n d \ h' 


By Theorem 6-5, the sum ^ fJi(d) has the value 0 if h f > 1, and the 

d\h' 

value 1 if h' = 1. Hence 

2 »(d)f(^) = g(n). 

d\n X 7 

Conversely, suppose g(n) = ^ p(d)f(^j . Then 


2 «(d) = E 

din din 


2 <*«*’>/(#) 

d'ld V 7 


= 2 M(^')/(e) 

d'ef= n 

= X X 

eh'~n d'lft' 


As before, Theorem 6-5 implies that the sum ^ M-(d') has the 

d'\h' 

value 0 if h' > 1, and the value 1 if h f — 1. Hence 


^ g{d) =/(n). 

d I n 


Definition 6-3: Iff(n) andg(n) are two arithmetic functions 
satisfying the condition f(n) = X S(d), then we say that {/(n),g(n)} 

dl n 

is a Mobius pair. 


We shall now relate Mobius pairs to multiplicative functions. 

Theorem 6-7: If one of the functions in the Mobius pair 
{/(n),g(n)} is multiplicative, so is the other. 

PROOF: Suppose g(n) is multiplicative, and assume g.c.d.(m,n) — 
1. Then 


f(mn) = ^ g(d). 

d I mn 


Since m and n are relatively prime, we may separate each divisor 
d of mn into unique factors e and h such that e is a divisor of m, and h 
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is a divisor of n; note also that g.c.d. (e,h) = 1. Hence 
f(mn) = X 2 

el m h\n 

= 2 S(e) 2 g(h) 

elm h\n 

— f(m)f(n), 

and /(n) is also a multiplicative function. 

Now suppose f{n) is multiplicative. Then, by Theorem 6-6, we 
know that 

g(mn) = 2 

Hence, following an argument similar to that in the preceding para¬ 
graph, we see that 

g(mn) = H 

e\m h\n ' ' 

"2 

- 2 pwf(fj 2 

e|m N/ /iln x/ 

= g(m)g(n), 

hence g(n) is multiplicative. ■ 

We now study the functions 4 >{n ), d(n), and cr(n) by means of 
Mobius pairs. 

Theorem 6-8: {n,<£(n)}, {d(n), 1}, and {o-(n),n} are aZZ 

Mobius pairs . 

PROOF: Theorem 6-1 asserts that 

n = ]T c/>(d), 

din 

and by definition 

d(n)=^ 1 and <x(n) = ^ d. 

din din 
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Thus Theorem 6-8 now follows from Definition 6-3. ■ 


Noting that the functions /i(n) = 1 and/ 2 (n) — n are multiplica¬ 
tive, we deduce from Theorems 6-7 and 6-8 that </>(n), d(n) 9 and 
cr(n) are also multiplicative (an alternative proof of Theorem 6-4). 


EXERCISES (* means difficult) 

1. Prove that if cr k (n) — ^ d k , then cr k (n) is multiplicative. 

din 

2. Prove that if f{n) is multiplicative, then 


2 fi(d)f(d )=n {i -/(?)}* 

din Pin 

3. Use Exercise 2 to give a new proof of the second part of 
Theorem 6-2; namely, that 


5>w>3—n(i-J)- 

din p I n N ^' 


4. Let S (n) denote the number of square-free integers (see 
Exercise 13 of Section 6-2) not exceeding n. Prove that 

S(n) =2 I P(J) I- 

j=i 

*5. With the definition of S (n) given in Exercise 4, prove that 
S(n) =2 2 m(^)- [Hint: Use Theorem 6-5.] 

3=1 d 2 1 j 

*6. Use Exercise 5 to prove that 

s(»)=i md) tel, 

d = 1 L J 

where te] denotes the largest integer that does not ex- 
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7. Prove that xTT = X P- 2 (d)l<j>(d ). 

din 

8. Prove that ^ p(d) (f>{d) = (2 —p). 

din Pin 

9. Let A (d,e) be a function defined for all integers d and e. 
Prove that 


2 £A(d,e). 

din e|§ eln dl| 


10. Prove that if n = p 1 ai p 2 a2 • • • Pr° r > then 


2 




11. Prove that if 

/(n> =n 

din 

then 

g(n)=n/W)"®- 

d I n 

[Hint: Use logarithms.] 

12. Use Exercise 2 of Section 6-2 and Exercise 11 above to 
prove that 


n = n h iWl1 ^)/ 2 ■ 

h I n 


13. Prove that if 


then 

gW _l, *«/(§). 


(The symbol [ x ] denotes the largest integer not ex¬ 
ceeding x.) 
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14. The Inequality of the Geometric and Arithmetic Means 
asserts that for any nonnegative real numbers a ly a 2 , ..., a n . 


a t + a 2 + . . . + a n 


- (aia 2 . . . a n ) 


1 In 


Prove that 


> d lld(n \ 

d\ n 


15. Use Exercise 2 of Section 6-2 and Exercise 14 above to 
prove that 




CHAPTER 7 


PRIMITIVE ROOTS 


In Chapter 4, we remarked that the elements of a reduced residue 
system form a group under the operation of multiplication. Since 
later chapters require familiarity with the structure of this group, we 
shall study in some detail the reduced residue system modulo p, 
where p is a prime. As we proceed, we shall discover an integer g 
such that g, g 2 , . . ., g p_1 constitute a reduced residue system modulo 
p; if you are familiar with abstract algebra, you can see that such a 
system is a cyclic group. The integer g is called a primitive root 
modulo p. 


7-1 PROPERTIES OF REDUCED RESIDUE 
SYSTEMS 


Example 7-1; We know that <£( 10) =4, and we observe that 
{1,3,7,9} is a reduced residue system modulo 10. Since 


3 1 = 3=3 (mod 10), 

3 2 = 9=9 (mod 10), 

33 — 27 = 7 (mod 10), 

3 4 = 81 = 1 (mod 10), 


7 1 = 7 = 7 (mod 10), 

7 2 = 49 = 9 (mod 10), 

7 3 = 343 ^ 3 (mod 10), 

7 4 = 2401 = 1 (mod 10), 


we see that each of {3,3 2 ,3 3 ,3 4 } and {7,7 2 ,7 3 ,7 4 } is a reduced residue 
system modulo 10. 

In Example 7-1, we see that 4 (=</>( 10)) is the smallest positive 
integer h for which g h = 1 (mod 10), when g = 3 or 7. 

Definition 7 - 1 : If h is the smallest positive integer such that 

a h = 1 (mod m), 

we say that a belongs to the exponent h modulo m. 
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Theorem 7-1: In order that 

a b = 1 (mod m) 

for some integer b, it is necessary and sufficient that g.c.d.(a,m) = 1. 

Proof: Let d = g.c.d.(a,m). Clearly, if d\a and d\m , then 
d | 1 and therefore d = 1 . 

If, on the other hand, g.c.d. (a, m) = 1, Euler’s theorem (Theorem 
5-2) asserts that 

a d>(m) = l ( moc l m ). ■ 

Theorem 7-2: If a belongs to the exponent h modulo m, and 
a r = 1 (mod m), 


then h | r. 

Proof: By Euclid’s division lemma (Theorem 2-1), 
r=kh + s (0 ^ s <h). 

Hence 1 = a r = a kh+s = ( a h ) k a s = a s (mod m). Thus 5 = 0, since h is 
the least positive exponent such that a h s 1 (mod m). Therefore h\r. 

Definition 7-2: If g is an integer belonging to the exponent 
</>(m) modulo m, then g is called a primitive root modulo m. 

Theorem 7-3: If g is a primitive root modulo m, then g, 
g 2 , . . ., g^ (m) are mutually incongruent and form a reduced residue 
system modulo m. 

Proof: Suppose 1 < s < r ^ <^(m) and 
g r = g s (mod m). 

Then m\g r -g\ that is, m \ g s (g r ~ s ~ 1). Hence, by Theorem 2-3, 
m | g r ~ s — 1. Consequently, g r ~ s = 1 (mod m). Thus, r — s is a positive 
integer less than 4>(m) such that 

gr-s = i ( mo d m). 

This contradicts the fact that g belongs to the exponent </>(m), and 
the theorem is proven. ® 
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Example 7-2: Theorems 7-1 and 7-3 enable us to show that 
there are no primitive roots modulo 8. Suppose that g is a primitive 
root; then, since <f>{ 8) = 4, g must belong to the exponent 4 modulo 8. 
By Theorem 7-1, we know that g.c.d.(g,8) =1; hence g must be 
congruent to one of 1, 3, 5, and 7 modulo 8. However, 

l 2 = 1 (mod 8), 5 2 = 1 (mod 8), 

3 2 s 1 (mod 8), 7 2 = 1 (mod 8). 

Therefore, g cannot belong to the exponent 4 modulo 8, and conse¬ 
quently there are no primitive roots modulo 8. 

We need not go through trial-and-error calculations, as in 
Examples 7-1 and 7-2, in order to determine the number of mutually 
incongruent primitive roots modulo m. There is a simple formula for 
this number, and this formula is the subject of the remainder of 
this section. 

Theorem 7-4: If a belongs to the exponent h modulo m and 
g.c.d. (k,h) = d , then a k belongs to the exponent h/d modulo m. 

Proof: Suppose a k belongs to the exponent j modulo m. Then 

a kj = 1 (mod m). 

Thus h | kj. Let h 1 = h/d and k x = k/d ; then 


K | k x j. 


Since g.c.d. (h,k) = d, it is true that g.c.d. (h l9 k 1 ) = 1; from this we 
see that 


hi | j. 


On the other hand, 

a khi = a hikid — a hk 1 = ( a ^)fe 1 = 1 (mod m). 

Thus j | h x . It follows that j — h x = h/d . ■ 

Corollary 7-1: If g is a primitive root modulo m, then g r is a 
primitive root modulo m if and only if g.c.d.{r,</>(m)} == 1. 

PROOF: By definition, a primitive root is an integer that belongs 
to the exponent </>(ra). By hypothesis, g belongs to the exponent 
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( j>(m ). Therefore, g r belongs to $(m)/g.c.d.{r,</>(m)}, and this 
number equals </>(ra) if and only if g.c.d.{r,</>(m)} = 1. ■ 

Theorem 7-5: If there exist any primitive roots modulo m, 
there are exactly ${<Mm)} mutually incongruent primitive roots. 

Proof: Let g be a primitive root modulo m. Then g, g 2 ,..., g* (m) 
form a reduced residue system modulo m. By Corollary 7-1, we know 
that g r is a primitive root if and only if g.c.d. { r, </> (m)} = 1. However, 
by definition of the </>-function, exactly <$>{<$> (m)} integers in the 
interval [1 ,</>(m)] are relatively prime to ■ 

Example 7-3: Let m = 10. Theorem 7-5 tells us that there are 
${<(>( 10)) = <£>(4) = 2 mutually incongruent primitive roots modulo 10 
if there are any. From our calculations in Example 7-1, we see that 
3 and 7 are two such mutually incongruent primitive roots modulo 10. 

Even though we have Theorem 7-5, we are still faced with the 
problem of determining which moduli have any primitive roots at all. 
This is the subject of the next section. 


EXERCISES 

1. Let g be a primitive root of m. An index of a number a 
to the base g (written ind^a) is a number t such that g ( = a 
(mod m). Given that a = h (mod m) and that g is a primitive 
root modulo m, prove the following assertions: 

(a) ind^a = ind g b {mod </>(ra)}, 

(b) ind^ac = ind^a + ind 9 c {mod </>(m)}, 

(c) ind^a w = nind g a {mod </>(m)}. 

2. Construct a table of indices of all integers for m = 17 and 
g = 3. 

3. Solve the congruence 9x = 11 (mod 17), using the table 
constructed in Exercise 2. 

4. Solve the congruence 4y 2 = 1 (mod 17), using the table 
constructed in Exercise 2. 

5. Solve the congruence 12x 2 — 7 (mod 17), using the table 
constructed in Exercise 2. 
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6. Find all primitive roots modulo 5, modulo 9, modulo 11, 
modulo 13, and modulo 15. 

7. Suppose g is a primitive root modulo p (a prime) and 
suppose m\p — 1 (l<ra<p — 1). How many integral 
solutions are there of the congruence 


x m g = 0 (mod p)? 


7-2 PRIMITIVE ROOTS MODULO p 

A general theorem asserts that m has primitive roots if and only if 
m is 2 or 4 or a number of the form p a or 2 p a , where p denotes an odd 
prime. We shall prove only that all primes have primitive roots. The 
complete theorem is covered in the exercises at the end of this 
chapter. 

Theorem 7-6: For each prime p> there exist primitive roots 
modulo p . 

Proof: Consider the reduced residue system 1 , 2, . . p — 1 
modulo p. Let N (h) denote the number of these integers that belong 
to h modulo p. Now we know that if a belongs to the exponent h 
modulo p, then h | p — 1. We also know that every element of a re¬ 
duced residue system belongs to some h modulo p. Consequently, 

P~ 1=2 N(h). (7-2-1) 

h Ip-1 


We shall now show that N(h) is either 0 or </>(/i). If no integer 
belongs to h modulo p, then clearly N(h) =0. If a belongs to h 
modulo p, we examine the equation 

x h = 1 (mod p). (7-2-2) 

By Theorem 5-5, we know that (7-2-2) has at most h mutually in- 
congruent solutions. However, since a, a 2 ,..., a h are mutually incon- 
gruent solutions of (7-2-2), there are exactly h solutions. Thus, any 
solution of (7-2-2) must be congruent to a r for some r. From 
Corollary 7-1, we know that a r belongs to h if and only if g.c.d .(r,h) = 1. 
Hence, among the a, a 2 , . . ., a\ there are exactly <f)(h) numbers that 
belong to h modulo p. Thus, in this case, N (h) = <})(h). 

We see, therefore, that <j>(h) ^ N (h) for all h. If there existed a 
single h for which ${h) >N(h), it would follow from (7-2-1) and 
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Theorem 6-1 that 


p-1= 2 W) < 2 = P~1. 

which is impossible. Hence, IV (/i) = 4>(h) for all h that divide p — 1. 
Thus, 4>(p — 1) integers belong to the exponent p — 1 modulo p; that 
is, there are <f>(p — 1) primitive roots modulo p. ■ 

Example 7-4: Let p — 5. A reduced residue system modulo 5 
is {1,2,3,4}. By Theorem 7-6, we know that there are primitive roots 
modulo 5; hence, by Theorem 7-5, there must be two mutually in- 
congruent primitive roots modulo 5. Since 


2 1 = 2 (mod 5), 

2 2 = 4 (mod 5), 

2 3 = 3 (mod 5), 

2 4 = 1 (mod 5), 


3 1 = 3 (mod 5), 

3 2 = 4 (mod 5), 

3 3 = 2 (mod 5), 

3 4 = 1 (mod 5), 


we see that 2 and 3 are mutually incongruent primitive roots modulo 5. 


EXERCISES (* means difficult) 

*1. Suppose g.c.d. (m,n) = 1, a h = 1 (mod m), and a k = 1 
(mod n). Prove that if j = hk/g.c.d.(/t,k), then a 3 = 1 
(mod mn). 

*2. Prove that if a is odd and n ^ 3, then a 2 ” 2 = 1 (mod2 w ). 
[Hint: Use mathematical induction on n, together with 
the equality (a 2 ” -3 ) 2 = a 2 * 1 \] 

*3. Use Exercise 2 to prove that the only powers of 2 having 
primitive roots are 2 and 4. 

4. Prove that g.c.d.{</>(ra),<Mn)} > 1 unless either morn 
equals 2 or 1. [Hint: Use Exercise 4 of Section 6-1.] 

*5. Use Exercises 1 and 4 to prove that an integer divisible 
by two distinct odd primes cannot have a primitive root. 

*6. Use the results in the previous problems to show that 
the only integers that can have primitive roots are 2,4, p r , 
and 2p r , where p is an odd prime. 
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*7. Prove that if g is a primitive root modulo p (p an odd 
prime) and g p ~ x = 1 (mod p 2 ), then (g-f p) p_1 ^ 1 
(mod p 2 ). 

8. Use Exercise 7 to prove that there exists a primitive root 
g modulo p (p an odd prime) such that g p_1 ^ l(modp 2 ). 

9. Prove that if g.c.d.(a,p) = 1, then a P m ~p m ~ 1 = \ (mod p w ), 
where p denotes an odd prime. 

*10. Let g be a primitive root modulo p (p an odd prime) with 
g p_1 ^ 1 (mod p 2 ). Prove that g ip ~ 1)pm 2 ^ 1 (mod p m ) 
for every m ^ 2. [Hint: Proceed by mathematical induc¬ 
tion on m, using Exercise 9 and the relation g^ p -^ p7n ~ 2 = 

(g (P - 1 )p m “3 ^ p j 

11. Prove that if g is a primitive root modulo p (p an odd 
prime), then g belongs to h modulo p m , where /i = 
(p — l)p r for some r. 

12. Use Exercises 10 and 11 to prove that if g is a primitive 
root modulo p (p an odd prime) and g p_1 ^ 1 (mod p 2 ), 
then g is a primitive root modulo p m . 

13. Prove that some odd numbers are primitive roots modulo 
p m for each odd prime p and each positive integer m. 

14. Prove that each odd primitive root modulo p m (p an odd 
prime) is a primitive root modulo 2 p m . 

15. How many primitive roots exist for the moduli 6, 7, 8, 9, 
and 10? 

16. To what exponents do 1, 2, 3, 4, 5, 6, 7, 8, 9, and 10 
belong modulo 11? In this particular case, verify that 
N ( h) = 4>{h) for all h that divide 10. 
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As you learned in Chapter 2, primes play a fundamental role in 
the theory of numbers. In fact, some of the most striking results in 
number theory, the Quadratic Reciprocity Law, Wilson’s theorem, and 
Fermat’s little theorem, reveal interesting properties of primes. 

Gauss was the first to give careful attention to tt(x) , the number of 
primes that do not exceed x. Observing that the values of i r(x) were 
well approximated by x/log x*, he conjectured that 

lim 7T —~ = 1. (8-0-1) 

x * 00 X 

log X 

J. Hadamard and C. de la Vallee Poussin independently proved 
(8-0-1) in 1896, using some very sophisticated techniques in the 
theory of complex variables. This result has become known as the 
Prime Number Theorem. Although, in 1948, Atle Selberg and Paul 
Erdos proved the Prime Number Theorem without complex variables, 
theirs and all other proofs known today are intricate and long. 

Consequently, after examining some simple results concerning 
7 r(x), we shall prove a theorem which, while resembling the Prime 
Number Theorem, is far easier. This result of Tchebychev asserts 
that there exist positive numbers c x and c 2 such that 

Cl log* < 7r(x) < Cz logic’ (8-0-2) 


for all x ^ 2. 


8-1 ELEMENTARY PROPERTIES OF tt(x) 

We have not yet established the existence of infinitely many 
primes, a problem that is easily settled with Euclid’s elegant proof. 

^Throughout the text, log x = log e %. 
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Theorem 8-1: lim tt(x) = +<*>; that is, there exist infinitely 

X-*oo 

many primes. 

PROOF: Assume that there are only finitely many primes, say 
Vu P 21 • • - i Pn • Let M = PiP 2 . . . p n 4-1. Clearly, if we divide M by 
any of p l9 p 2 , . . p n , the remainder is 1. Thus, by our hypothesis 
that there are only finitely many primes, we deduce that M has no 
prime factorization. Since this contradicts the fundamental theorem 
of arithmetic, the only possible conclusion is that there are infinitely 

many primes. Thus lim 7 r(x) = + 00 . ■ 

*-> 00 

On the other hand, it is clear that 7 t(x) ^ x. Actually tt(x) is 
much smaller than x. This is reflected in the following three theorems. 

Theorem 8-2: If k is any positive integer , 

*(*) ^ <t>(k) ( 2k 
x k x 


PROOF: Suppose [x] = kl + r, where 0 ^ r < k, and where [x] 
denotes the largest integer not exceeding x. We divide the integers of 
[l,x] into l sets of k consecutive integers plus a remaining set of the r 
integers kl + 1, kl 4- 2, . . ., kl + r. 

Among the integers 1, 2, . . ., k there are obviously at most k 
primes. Among the integers k + 1, k + 2, . . ., 2k, there are at most 
fi(k) primes, since any integer not relatively prime to k has a prime 
factor in common with k that is less than or equal to k. Similarly, in 
each of the remaining sets of k consecutive integers, there are at most 
<l>(k) primes. Finally, in the remaining set of r integers, there are at 
most r primes. Consequently, 

tt (x) ^ k+ (Z-l)<Mfc) + r= £ 2fc+-!<Mfc). 

Hence 

iW <M 1 ^ _ 

X ~ k X ' ■ 


The next result allows us to estimate the size of fi{k)/k. 

Theorem 8-3: IfM > 1 and p l9 p 2 , . . ., p s are all the primes in 
(1,2, . . .,M), then 


V — < 

^ rt 
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Remark: From Theorem 8-3, we may deduce as follows that 
00 1 1111 1 

the infinite series V — = «+ o +k + ^ + • • diverges: If we let 

& Pi 2 3 5 7 11 

M -» oo in Theorem 8-3, we see that Y[ (l —= 0. Therefore, 

i=l \ Pi' 

Y — diverges, by Theorem B-2 of Appendix B. 

i=l P l 


Proof: From the formula for the sum of a geometric series, we 
know that for each prime p 


1 



i+^+Vt 

P P 2 P 6 


Consequently, 



where A is the set of all integers each of whose prime factors is at 
most M. That the last equality holds is clear, since multiplying these 

series together gives all possible terms of the form ai a2 

Pi F2 •••Ps 

with Pi ^ M. H 

While the next theorem shows that tt(x) is much smaller than x , 
we require still more groundwork before proving Tchebychev s 
result, which explicitly describes the relative smallness of tt(x). 

Theorem 8-4: lim - ^ — = 0. 

x-»<» X 

Proof: Since ^ 0 for x > 0, we need only show that we 
x 

can make tt(x)/x arbitrarily small by choosing x sufficiently large. 
Theorem 8-2 has established that 
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Tr(x) < (t>(k) + 2k 
x k x 

for each positive integer k. Let M be a large integer, and let 
k = P\P 2 ■ ■ - P s , where {pi,p 2 , . . . p s } is the set of all primes not 
exceeding M. Then 

<Mk) . k { 1 ~'p){ 1 ~p') ' ' ' ( 1- ^) 
k k 

Thus 

Zl*l < / V IV 1 + 2piP * • • • P * (8-1 -1) 

x V^i n) x 

It is now a simple matter to make 7r(x)/x as small as we like. 

00 1 

Since V — is a divergent series, we can choose M so large that 

n = i n 
M 1 £ 

V — > -, where e > 0 is an arbitrary positive number. Then, for 

n =1 n € 

x > 4pip 2 ♦ ■ - P s ^ 


7T(X) ^ } 2p x P2 . . . p s € _ £ m 

x 2 4 p x p 2 . . . Vs 

To conclude our section on elementary properties of 7r(x), we 
now present two additional results indispensable to proving (8-0-2). 

Theorem 8-5: If [x] denotes the largest integer that does not 
exceed x, then 0 ^ [2x] — 2[x] ^ 1. 

PROOF: The two inequalities 

2x — 1 < [2x] ^ 2x, 

and 

2x — 2 < 2[x] ^ 2x 

are direct consequences of the definition of [x]. Thus 


—1 < [2x] —2[x] <2. 
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However, [2x] — 2[x] is an integer, and the only integers in the 
interval (—1,2) are 0 and 1. Thus 0 ^ [2%] — 2[x] ^ 1. ■ 


Theorem 8-6: If p is a prime, then 
p appearing in the prime factorization ofnl. 


is the exponent of 


PROOF: Note that if p > n, then p does not appear in the prime 
factorization of n! and every term in ^ is zero as desired. 

lip ^ n, 
namely 

p, 2 p, 3 p,. . p. 


then ~ integers in {1,2, ...,n} are divisible by p, 


Of these integers, 



are again divisible by p : 


p 2 ,2 p 2 , . . p 2 . 

By the same logic, of these are divisible by p a third time: 

p 3 , 2 p 3 , . . ., p 3 . 


After finitely many repetitions of this argument, we see that the total 
number of times p divides numbers in {1,2, ...,n} is precisely 



consequently, this sum is the exponent of p appearing in the 


prime factorization of n\. 


EXERCISES 

1. The Fermat numbers are numbers of the form 2 2 ” +1 = O n . 
Prove that if n < m, then <I>„ | 3> m — 2. 

2. Prove that if n ^ m, then g.c.d.(<I> w ,<& m ) = 1. 
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3. Use Exercise 2 to give a new proof that there exist in¬ 
finitely many primes. 

4. Modify the proof of Theorem 8-1 to prove that there exist 
infinitely many primes congruent to 5 (mod 6). 

5. Modify the proof of Theorem 8-1 to prove that there exist 
infinitely many primes congruent to 3 (mod 4). 

6. Suppose that p l9 p 2 , . . ., p r are the only primes congruent 
to 1 (mod 4). Prove that4px 2 p 2 2 • • • Vr + 1 is divisible only 
by primes congruent to 3 (mod 4). 

7. Assuming that all odd prime factors of integers of the form 
x 2 + 1 are congruent to 1 (mod 4), use Exercise 6 to prove 
that there exist infinitely many primes congruent to 
1 (mod 4). 

8. Let a i = 2, and for each n > 1, let a n be the least integer 
for which g.c.d. {a u a n ) = 1 for each positive i < n. What 
is a 2 ? a 3 ? a 4 ? a 5 ? a 6 ? a n ? 

9. Use the proof of Theorem 8-1 together with mathematical 
induction to prove that the nth prime p n is less than 
2 2 ” + 1. (Alternatively, use Exercise 3.) 

10. Prove (using Exercise 9) that 

7t(x) > C loglog X 
for some absolute constant c. 

11. (a) Prove that every integer can be factored uniquely into 

the product of a square-free number (see Exercise 13 
of Section 6-2) and a perfect square. 

(b) Prove that if only r primes existed, then there would 
be exactly 2 r square-free numbers. 

(c) From (a) and (b) deduce that if there were only r 
primes, then there would be at most 2 r Vn integers not 
exceeding n. Show that this is impossible for n suffi¬ 
ciently large. 

(d) Deduce from (c) that 
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7T(x) > C log X, 
for some absolute constant c. 

12. Let 6(x) be the sum of the logarithms of all the primes not 
exceeding x. Prove that 

0(x) ^ 7 T(x) fog X. 

13. Suppose there were only finitely many primes 2,3,5,... ,p. 
Let M denote the product of all the primes. By evaluating 
</>(M), prove that there exist primes not dividing M. 

14. Find the smallest positive integer for which x 2 —x + 41 is 
not a prime. 

15. Find the prime factorization of 

2,432,902,008,176,640,000 = 20!. 

16. How many zeros are there at the end of the base 10 repre¬ 
sentation of 132! ? at the end of the base 2 representation? 

17. Does = p~J f° r each real x and each integer n > 1? 

18. Prove that 0 ^ [nx] — n[x] ^ n — 1 for each real number 
x and each integer n ^ 1. 


8-2 TCHEBYCHEV’S THEOREM 

In order to establish Tchebychev’s result (8-0-2), we must 
examine some elementary properties of the function x/log x. 

Theorem 8-7: Iff(x) = x/log x, then 


f(x) is increasing for x > e. 

(8-2-1) 

f(x - 2) > | f(x) for x a 4, 

(8 -2-2) 

/( 2 )< 16 f(x)forx^8. 

(8 -2-3) 
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PROOF: Since 


/'(*) = 


log x — 1 


(logx) 2 ’ 

we see that/'(x) > 0 for x > e; this establishes (8-2-1). 

To prove (8-2-2), we note that if x — 4, then x—2^ x/2; thus, 

f(x-2) - log(x _ 2 ) - 2 log(x — 2) 2 log x — 2 X ' 

To prove (8-2-3), we note that if x ^ 8, then x/2 ^ x 2/3 , and 
x + 2 ^ 5x/4 ; hence, 


/(^) 


< 


x 4- 2 


x + 2 


5x/4 15 C( v 

= TafM- 


2 log (x/2) 2 log x 2/3 4 log x 16 

3 


We are now ready to prove Tchebychev’s theorem. 
Theorem 8-8: For x£8, 
log 2 


* < w(x) < 30(log 2) * 


log X 


logx ‘ 


Proof: Let us examine the binomial coefficient, 




the num¬ 


ber of combinations of 2n things taken n at a time. We recall from 
Theorem 3-2 that 

(2n\ (2nl) 2n(2n-l) , . . (n+J) 

\nj (n!)(n!) n(n — 1) . . . 1 

Now, any prime p in the interval (n,2n] must appear as a factor in 
the numerator of ; since it cannot appear in the denominator 

(it is larger than n), we see that p Hence, multiplying all such 

primes together, we find that 

( 2 ;). 


where P n denotes the product of all primes larger than n but not 
exceeding 2 n. 
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Thus, since each prime appearing as a factor of P n is larger than n, 
and since there are 7r(2n) — 7r(n) prime factors of P rt , we see that 


n n(2n)-7r(n) <? n < 



(8-2-4) 


On the other hand, suppose that corresponding to each prime p 
we define r p by the inequalities p r p ^2 n < p r p +l . Using Theorem 
8-6 to determine what power of p appears in the prime factorization of 


(*;) , we see that the correct exponent is the power of p appearing in 
(2n)! minus the power of p appearing in (n!) (n!); in other words, the 
exponent is ~p 5 )* Theorem 8-5, 


0 < 



-2 



1 = r p . 


Hence, we see that 



where Q n denotes the product of all p r p. Since each p r p does not ex¬ 
ceed 2n, and since Q n has 7r(2n) factors of the form p r p. 


( 2 J*) < Q n < (2 n) n{2n> . (8-2-5) 

As soon as we determine the size of (^*) > we shall see that 

Tchebychev’s theorem may be deduced from (8-2-4) and (8-2-5). 

By the binomial theorem (Exercise 10 of Section 3-1), 

(l + x) 2 ” = l + ( 2 1 n )x + ( 2 2 n )^+. . , + ( 2 n n )*» + . . . + 


Hence, with x = 1, we find that 

2 " - 1 + CD + CD + ■ ■ • + CD ■ + " - +1 > CD- 18 - 2 - 61 

On the other hand, 


/2n\ _ 2 n 
\n ) n 


2n (2 n — 1) (2n — 2) (n + 1) 
(n — 1) (n — 2) * 1 


> 2 ■ 2 • 2 . . . 2 = 2 W . 


(8-2-7) 
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Combining (8-2-5) with (8-2-7), we find that 

2 n < (2n) 7r(2w) . (8-2-8) 

Taking logarithms of both sides of (8-2-8), we obtain the inequality 
n log 2 ^ 7r(2n) log 2n. (8-2-9) 

Thus, if x ^ 5 and/(x) = x/log x, then by (8-2-9), (8-2-1), and (8-2-2), 


7T{X) — 7 T (2 | ^ 


log 2 


log 2 


_ log 2 x 
4 log x ’ 

and this gives the left-hand inequality of Theorem 8-8. 

To obtain the other half of Tchebychev’s theorem, we combine 
(8-2-4) with (8-2-6). Thus, 

n 7r(2n)-ir(n) < £2n (8-2-11) 

Taking logarithms of both sides of (8-2-11), we find that 
(7r(2n) — 7r(n)) log n <2n log 2. 


Hence, 


7r(2n) < (2 log 2) 


n 


■ 4- 7r(n). 


log n ' '* x (8-2-12) 

We may now establish by mathematical induction that 


77 (2n) < 32 (log 2) 


log n 


, for n > 1. 


First, we note that (8-2-13) is true for 2 ^ n ^ 8: 

77(4) =2 < 77(6) =3 < 77(8) =4= 77(10) =4 

< 77(12) = 5 < 77(14) = 6 = 77 ( 16 ) = 6 < 64 
2 


= 32 (log 2) 


log 2* 


(8-2-13) 
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Now assume (8-2-13) for all integers n ^ k, where k 2= 8; then, 
by (8-2-12) with/(x) = x/log x, 

ir(2k + 2) < 2 (log 2) f{k + 1) + tr{k + 1) 

S 2 (log 2 )f(k + 1) + 7T (2 [^]) 

< 2 (log 2) f(k + 1) + 32 (log 2) / ([^p]) 

— 2 (log 2)/(fc + l) +32 (log 2)/(pp) 

< 2 (log 2 )f(k + 1) + 32 (log 2) || f(k + 1) (by (8-2-3)) 

= 32 (log 2) f(k + 1) = 32 (log 2) log ^ 1} • 

Hence, 

ir(2n) < 32 (log 2) for all n > 1. 

Thus, for each real number x ^ 8, 

tt(x) < 77(2 | + 2 ) <32(log2)/([|] + l) 

^32 (log 2)/(pp) 

< 32 (log 2) || f(x) (by (8-2-3)) 

= 30 (log 2) f(x) 

= 3° dog 2)^. ■ 


EXEBCISES 

1. Deduce from Theorem 8-8 that if x is sufficiently large, 
there exists a prime between x and 125x. 


Prove that if n ^ 4 and 2n/3 < p ^ n, then p 



3. From the definiti on of r p on page 108, deduce that if 
r p — 2, then p — V2n. 
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In Exercises 4 and 5, assume that there are no primes p such 
that n < p ^ 2 n. We denote the product of all primes not 
exceeding x by R x . 

4. Using the assumption above and the first inequality in 
(8-2-5), prove that 

( 2 n ") S (2„)v® V 

5. From (8-2-7) and the assumption that R x ^ 4 X for each 
real x , deduce that the inequality in Exercise 4 is impossi¬ 
ble if n is large. 

6. Assuming for each real x that R x ^ 4 X , prove Bertrand’s 
postulate: there exist primes between n and 2n for n suffi¬ 
ciently large. 

7. Using Exercise 12 of Section 8-1, prove that 

0(x) < 30 (log 2)x. 


8-3 SOME UNSOLVED PROBLEMS ABOUT PRIMES 

Although primes form the multiplicative building blocks of the 
integers, many seemingly elementary questions about them are yet 
unanswered. 

For example, in a letter to L. Euler (1742), C. F. Goldbach con¬ 
jectured that: every even number larger than 2 is the sum of two 
primes . It is simple to show that the statement is true in the case of 
small numbers; for example, 4=2 + 2, 6=3 + 3, 8 = 5 + 3, 10=7+3 = 
5 + 5, 12 = 7 + 5, 14 = 11+3 = 7 + 7, 16 = 13 + 3 = 11+5,.... How¬ 
ever, whether the statement is true for all even integers is still un¬ 
settled. Nevertheless, it is supported by existing evidence. A Russian 
mathematician, I. M. Vinogradov, proved that all large odd integers 
are the sum of three primes. Surprisingly, his techniques involve ex¬ 
tremely subtle use of the theory of complex variables; no one has 
been able to extend them in order to solve Goldbach’s conjecture. 

Also unsolved is the famous Twin Primes Problem: are there 
infinitely many primes p such that p + 2 is also a prime ? Thus 3 and 
5, 5 and 7, 11 and 13, 17 and 19 are all examples of twin primes. 
Numerical evidence makes it plausible that infinitely many such 
pairs exist. 




112 


PRIME NUMBERS 


Finally, we mention the Mersenne Primes ; that is, prime numbers 
of the form 2 P — 1 where p is also a prime. We have already met such 
numbers in Section 3-5 in connection with perfect numbers. M. 
Mersenne asserted in 1644 that 2 P — 1 is prime for p = 2,3,5,7,13,17, 
19, 31, 67, 127, 257, and for no other primes p < 258. Actually, 
2 67 — 1 and 2 257 — 1 are not primes while 2 61 — 1, 2 89 — 1, and 2 107 — 1 
are; however, it is quite surprising that Mersenne —almost 300 years 
before the invention of the modem electronic computer—had only 
five mistakes in his list. In 1963, D. Gillies showed that the primes p 
not exceeding 12143 for which 2 P — 1 is also prime are 2, 3, 5, 7, 13, 
17, 19, 31, 61, 89, 107, 127, 521, 607, 1279, 2203, 2281, 3217, 4253, 
4423, 9689, 9941, and 11213. Recently B. Tuckerman proved that 
2 P — 1 is prime for p = 19937. It is not known whether there exist in¬ 
finitely many Mersenne primes. 

Some other unsolved problems about primes, such as the Riemann 
hypothesis, require considerable background even for the compre¬ 
hension of their statements. The three problems we have described 
are among the best known and easiest to understand. 




PART II 


QUADRATIC 

CONGRUENCES 


In Part I, we saw the importance of congruences to the 
study of multiplicative questions, and we completely solved 
the linear congruence 


ax = b (mod c). 

Part II is devoted to a more advanced study of congru¬ 
ences involving quadratic polynomials. Perhaps the simplest 
of these is 


x 2 = a (mod p) 

where p is a prime. In examining this congruence, we shall 
encounter Gauss's celebrated Quadratic Reciprocity Law, and 
we shall acquire information important to the additive prob¬ 
lem: in how many ways can an integer be represented as a 
sum of two squares? Part III contains an extensive investiga¬ 
tion of this question. 

As you can see, therefore, our study of quadratic congru¬ 
ences serves to link the multiplicative to the additive aspects 
of number theory. 
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CHAPTER 9 


QUADRATIC RESIDUES 


In our study of congruences, we discussed the circumstances 
under which 


ax = b (mod c) 

has solutions. The next simplest congruence is 

x 2 = b (mod n). (9-0-1) 

The ability to solve (9-0-1) will in most cases enable us to determine 
whether a quadratic congruence of the form 

ax 2 + bx + c = 0 (mod d ) 


has solutions. 


9-1 EULER’S CRITERION 

Our first step is to develop a test for determining whether there 
exists an integer x such that 


x 2 = a (mod p), (9-1 -1) 

where p is a prime and g.c.d.(a,p) = 1. If p^[a and (9-1-1) has a solu¬ 
tion, we shall say that a is a quadratic residue modulo p. 

Example 9-1: Let p = 7. Since 1, 4, and 9 are perfect squares 
not divisible by 7, they are quadratic residues modulo 7. Any integer 
congruent to one of these squares modulo 7 is also a quadratic residue 
modulo 7; hence —6, 2, and 11 are all quadratic residues modulo 7. 
Although 49 is a perfect square, it is not a quadratic residue modulo 7 
since 7 | 49. 
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Theorem 9-1: The number a is a quadratic residue modulo p 
if and only if 


p -1 

a 2 =1 (mod p). 


Proof: Suppose that a is a quadratic residue modulo p. LetX 
be any integer such that 


X 2 = a (mod p). 

Since p~|~a, we see that p~t~X. Consequently, 

a~^ = (X 2 )^ = X*' 1 = 1 (mod p). 


by Euler’s theorem (Theorem 5-2). 

On the other hand, suppose that 

p-i 

a 2 =1 (mod p). 


Let g be a primitive root modulo p. Then there exists an integer r such 
that 

g r = a (mod p), 

and so 

grip - 1)/2 = a 2 = l (mod p). 

But, from Theorem 7-2, we see that p — 1 | r(p — l)/2. Thus, r/2 
must be an integer; that is, r = 2s, where s is an integer. Hence, if 
x = g s , then 


x 2 = g 2s = g r = (mod p); 

this establishes Euler’s Criterion. B 

This proof furnishes a useful corollary. 

Corollary 9-1: Let g be a primitive root modulo p, and 
assume g.c.d.(a,p) = 1. Let r he any integer such that g r = a (modp). 
Then r is even if and only if a is a quadratic residue modulo p. 

EXERCISE 

1. Use Euler’s Criterion to determine whether a is a quadratic 
residue modulo p in each of the following instances: 
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(a) a = 2,p = 5; 

(b) a = 4, p = 7; 


(c) a = 3, p= 11; 

(d) a = 6, p = 13. 


9-2 THE LEGENDRE SYMBOL 


We shall now introduce the Legendre symbol (^), which greatly 
simplifies computations in problems on quadratic residues. 


Definition 9-1: If p is an odd prime, then 



if a is a quadratic residue modulo p, 
if p\a, 
otherwise . 


Theorem 9-2: If p is an odd prime and a and h are relatively 
prime to p, then 


(;>) ” ip)’ " b !nl0 ' d ,9 ' 2 ' n 

(?)-(§)©• 

PZ i! /n\ 

0,2 - ( mod ?)• (9-2-3) 

PROOF: Statement (9-2-1) follows directly from the definition 
of (~j> while (9-2-2) follows from Corollary 9-1. 

As for (9-2-3), we see that if a is a quadratic residue modulo p, 
then Theorem 9-1 implies that 


a 2 = 1 = (^) (mod p). 

If p | a, then 

a 2 = 0 = (f) ( mod P)- 

Finally, if p fa, then 

p -i 

a 2 = ±1 (mod p), 
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since 


a p 1 = 1 (mod p). 

Thus, if g.c.d. (a,p) = 1 and a is not a quadratic residue modulo p, we 
see that 

a~ = -1 = (^) (mod p). ■ 

For the following exercises, we extend the definition of the 

symbol to include the case where m is any odd number: 

If m = PiP 2 . . • p r where the p t are odd primes (not necessarily 
distinct), then 



This extended symbol is called the Jacobi symbol. 

EXERCISES 

1. Prove that if c is odd, then (~~J = 

2. Prove that if b and c are odd, then = • 

3. Prove that if a = b (mod c), where c is odd, then 

9-3 THE QUADRATIC RECIPROCITY LAW 

In this section, we shall prove a famous theorem of Gauss, the 
Quadratic Reciprocity Law, which enables us to solve almost all 
quadratic congruences. This is not an easy theorem; great mathe¬ 
maticians such as L. Euler and A. Legendre were baffled by it. It is 
some measure of the greatness of C. F. Gauss that he proved it when 
he was only nineteen years old. Because this theorem is of funda¬ 
mental importance in number theory. Gauss returned to study it 
many times throughout his life, and he gave at least six different 
proofs.* 



*That this theorem has occupied the attention of many mathematicians is amply 
borne out (although slightly exaggerated) in the title of an article by M. Gerstenhaber: 
“The 152nd Proof of the Law of Quadratic Reciprocity,” American Mathematical 
Monthly, 70(1963), 397-398. 
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Our first goal, however, is to prove a preliminary theorem called 
Gauss’s Lemma. 

Definition 9-2: If n is any integer , then the least residue of n 
modulo m is the integer x in the interval (—m/2, m/2] such that 
n = x (mod m). We denote the least residue ofn modulo m by LR m (n ). 

Example 9-2: The set {—5,—4,—3,—2,—1,0,1,2,3,4,5} is a com¬ 
plete set of least residues modulo 11. Thus LR n (2l) = — 1, since 
21 = — 1 (mod 11); similarly, LR n { 99) = 0, and LH 11 (60) = 5. 

Definition 9-3: We define sgn{x) {read signum of x) by 

f+1 if x > 0, 
sgn(x) = \ 0 ifx = 0, 

[—1 if x <0. 

In general, we note that x= \x \ sgn(x). 

Theorem 9-3 (Gauss's Lemma): Let g.c.d.(m,p) = 1 where p 
is an odd prime , and let p be the number of integers in the set 

jm, 2m, . . . ,|(p-l)mj 

whose least residues modulo p are negative. Then 


Proof: First note that none of the integers m, 2m,..., ^ {p — l)m 
is divisible by p. Now, for any n, 

nm = LR p (nm) = sgn (LR p (nm)) | LR p (nm) | (mod p). 

Since 0 < | LR p (nm) | < p/2, we see that as n takes all integral values 
in (0,p/2), so does \ LR p (nm) |. Consequently, 

m(2m)(3m) . . . ((p —l)m/2)= sgn (LR p (m)) sgn(LR p (2m)) 

. . . sgn(LR p ((p — l)m/2)) • |LR p (m) | * \ LR p (2m) \ 
... | LR p ((p - l)m/2) | (mod p), 
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or 

(| (p - 1)) ! m~ = sgn (LR v (m)) sgn(LR p (2m)) 

. . . sgn (l,R p (| (p - l)m)) • (| (p - 1)) ! (mod p). (9-3-1) 

Since Q (p — 1)) ! is relatively prime to p, we may cancel it from 
both sides of (9-3-1), by Theorem 4-3. Hence, 

m 2 s sgn (LR p (m)) sgn (LR p (2m)) ... sgn (l,R p ^ (p — l)m^ 

(mod p). 

But 

= (^) (mod p), 

by Theorem 9-2, and 

(-1)" = sgn (LR p (m)) sgn(LR p (2m)) . . . sgn (lR p (| (p — l)m)). 
Thus, 

(y) = (-1) M (mod p). 

Since each side of this congruence equals either +1 or —1, and 
since p > 2, 

( f )-(- 1 »'• ■ 

We are now ready to prove Gauss’s Quadratic Reciprocity Law. 

Theorem 9-4 (Quadratic Reciprocity Law): If p and q are 
distinct odd primes, then unless p = q = 3 (mod 4), in 

which case ^ . 


REMARK: This theorem may be stated equivalently in the form 



9-3 THE QUADRATIC RECIPROCITY LAW 
PROOF: Let p, denote the number of integers in the set 
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q, 2q, . . x (p ~ 1) <7 


with negative least residues modulo p. Let p 2 denote the number of 
integers in the set 


p, 2p, . . .,|(q-l)pj 


with negative least residues modulo q . Gauss s Lemma implies that 
(?) = (-1)"* and (^) = (-IT 1 . Then, since (|) = (j) if and only if 

(~) = T w e see that (-) = (—) if and only if 


(-Do.+c^l, 

Thus, to prove our theorem, we must show that pj 4- p 2 is odd if and 
only if p = q = 3 (mod 4). 

We proceed geometrically. We shall count in two ways the lattice 
points (that is, the points whose coordinates are integers) inside a 
certain hexagon. The first count will show that there are an odd num¬ 
ber of lattice points in the hexagon if and only if p = q = 3 (mod 4). 
The second count will show that there are pi + p 2 lattice points inside 
the hexagon. The two counts together will show that ^ + p. 2 is odd if 
and only if p = q = 3 (mod 4). 

In the first quadrant of the (x,y) -plane, we consider the hexagon 
H with vertices ABCDEF that lie on the rectangle AGDJ bounded by 
the coordinate axes and the lines x = p/2 and y = q/2, as shown in 
Figure 9-1. 


EF is defined by 




and BC is defined by 


p ,1 

X= q» + 2- 


The line BC has x-intercept ,0), and EF has (/-intercept 
both are parallel to the diagonal AD. 
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The coordinates of the points * (x,y) that lie in the interior of H 
must satisfy the inequalities 


0 < x < p/ 2 , 0 < y < ql 2 , 


(9-3-2) 


y < p X + b 


y > — x — ^~. 
p 2 p 


Note that if (m,n) is any lattice point in the interior of H, then so is 


(H 1 


m. 


Q + 1 


; we can verify this by substituting these 


coordinates into the four inequalities of (9-3-2). Now we see that 

, \ (p + 1 q + 1 \ 

(m,n) = g-m, —^-nj 


if and only if m — — - and n = — ^ ; and P: ^ " 4 a 

lattice point if and only if p = q = 3 (mod 4). Thus the pairing of 
(m,n) with — m, ^ ^ ^ shows that the number of lattice 

points in H is odd if and only if p = q = 3 (mod 4). 


*The reader is reminded here that (x,t/) is a point and is not to be confused with 
g.c.d.(x,i/). 


a|cq 
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Now let us consider the lattice points in H from a different point 
of view. It is clear that there are no lattice points on the diagonal 


y = — x. If (m,n) is a lattice point below the diagonal, then 


that is, 


L <n< am. 

p 2p p 


— £ < np — qm < 0. 


(9-3-3) 


We see that in this case np has a negative least residue modulo q , 
namely np — qm. Conversely, if np has a negative least residue, we 
can produce an m such that (9-3-3) holds; hence, (m,n) is in H and 
lies below the diagonal. Thus, since p 2 is the number of integers in 


the set 


P, 2p, 


(q-i)pj 


with negative least residues modulo 


q , there are p 2 lattice points in H below the diagonal AD. 

In exactly the same way, we find p t lattice points in H above the 
main diagonal. 

We conclude that p x + p 2 (the total number of lattice points in H) 
is odd if and only if p = q = 3 (mod 4); thus the Quadratic Reciprocity 


Law is proved. 


To complete our tools for solving quadratic congruences, we need 
the following consequence of Gauss’s Lemma (Theorem 9-3). 


Theorem 9-5: If p is an odd prime , then 



(9-3-4) 

(9-3-5) 


Proof: By Gauss’s Lemma, with m = —1, we see that p — 
- (p —1); this establishes (9-3-4). 

4i 


With m = 2, p is the number of integers in the set 2,4, . . .,p— 1 
whose least residues modulo p are negative, which is the same as the 

number of even integers in the interval ^ , p — 1 j . Thus, 


2s whenp = 8$ + l, 

2s + 1 when p = 8s + 3, 

2s + 1 when p = 8s + 5, 

2s + 2 when p = 8s + 7. 
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Hence, /a is even if and only if p = ±1 (mod 8). Consequently, 


/2\ _ f 1 when p = ±1 (mod 8), 

\p) 1 when p = ±3 (mod 8). 


Now, since 


f 1 when p = ±1 (mod 8), 
1 when p = ±3 (mod 8), 


we see that (9-3-5) is established. 


EXERCISES (In Exercises 1 through 3, the symbol appearing is the 
Jacobi symbol, defined prior to the exercises for Section 9-2). 

1. Prove that if c is odd, then = ( — 1)^ ° 1 * 

2. Prove that if c is odd, then = (—1) (c2 “ 1)/8 . 

3. Prove that if a and c are odd, then (^j = (—1) i U 1) c D . 

4. Use Gauss's Lemma to show that 17 is a quadratic residue 
modulo 19. 

5. 


6 . 

7 . 


Using the Quadratic Reciprocity Law, prove that 

/3\ | 1 if p = 1 or 11 (mod 12), 

\p/ [— 1 if p = 5 or 7 (mod 12) 

for each odd prime p. 

Is it possible that = 1 while the congruence x 2 = n 
(mod m) has no solution? Prove your answer. 

Does the congruence x 2 =631 (mod 1093) have any solu¬ 
tions? [Hint: Use the Jacobi symbol.] 
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9-4 APPLICATIONS OF THE 

QUADRATIC RECIPROCITY LAW 

We shall limit this discussion to solutions of quadratic congru¬ 
ences with odd moduli. In the following theorem, we consider such 
congruences for prime power moduli. 

Theorem 9-6: If p is an odd prime and g.c.d.(a,p) = 1 , then 
the congruence 


x 2 = a (mod p n ) (9-4-1) 

has a solution if and only if = 1. 

PROOF: First note that if (^j =—1, then the congruence (9-4-1) 

cannot possibly have solutions, because solutions of ( 9 - 4 - 1 ) auto¬ 
matically satisfy the condition 

x 2 = a (mod p). 

Suppose that (fj = 1 . We establish solutions for (9-4-1) by math¬ 
ematical induction. Since (fj = 1* (9-4-1) has solutions when n — 1 . 

Assume now that (9-4-1) has a solution x (} for n = k. Then, there 
exists an integer m such that 


x<? — a — mp k . 

If x 0 is an inverse of x 0 modulo p, then x 0 x 0 = 1 + rp. Hence, 
(x 0 — mx 0 | (p + 1 )p k ) 2 ~ a 

= x 0 2 -m(p + 1 )p k x 0 x 0 + m 2 x 0 2 (| (p + 1)) V* - a 
= x 0 2 -m(p + l)p fc (l + rp) + m 2 x 0 2 (| (p + 1 ))V* - a 
= —mp k+1 + m 2 x o 2 (| (p + 1)) V* ~ mr(p + l)p k+1 

= p k + 1 (-m- mr(p- 1-1) + m 2 x 0 2 (| (p + l))V -1 )- 
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Therefore, (9-4-1) has a solution for n = k + 1, and this establishes 
our theorem. H 

Let us study specific congruences: We begin with 

x 2 = 15 (mod 89). (9-4-2) 

Now, 

= (y) (n|) (by Theorem 9-4) 

- (!) (I) (by (9 - 2 - 1 » 

= (-i)i=—i. 


Thus (9-4-2) has no solutions. 

Next we consider the congruence 

x 2 ^ 12 (mod 2989). (9-4-3) 

Since 2989 = 7 2 • 61, where both 7 and 61 are primes, the Chinese 
Remainder Theorem and Theorem 9-6 tell us that (9-4-3) has solu¬ 
tions if and only if 


and 


x 2 = 12 (mod 7) 


x 2 = 12 (mod 61) 


(9-4-4) 

(9-4-5) 


have solutions. Now 


and 




Hence, (9-4-4) has no solutions, and therefore (9-4-3) has no 
solutions. 
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EXERCISES 

1. Does x 2 = 17 (mod 29) have a solution? 

2. Does 3x 2 = 12 (mod 23) have a solution? 

3. Does 2x 2 = 27 (mod 41) have a solution? 

4. Does x 2 +5x = 12 (mod 31) have a solution? [Hint: Com¬ 
plete the square.] 

5. Does x 2 = 19 (mod 30) have solutions? [Hint: Use the 
Chinese Remainder Theorem.] 




CHAPTER 10 


DISTRIBUTION OF 
QUADRATIC RESIDUES 


This chapter introduces a general counting technique in number 
theory from which we shall derive sharp and surprising results, with 
by-products that will be valuable in Chapter 11. 


10-1 CONSECUTIVE RESIDUES AND 
NONRESIDUES 

Corollary 9-1 implies that among the set of integers in the closed 
interval [l,p —1] (where p is an odd prime) there are (p —1)/2 
quadratic residues and (p —l)/2 nonresidues modulo p. Now, how 
many consecutive pairs of quadratic residues are there? 

If we can argue that there is equal likelihood for a particular 
integer to be a quadratic residue or a nonresidue, approximately one- 
fourth of the consecutive integer pairs in [l,p —1] should be con¬ 
secutive quadratic residues. We shall see that our estimate is correct. 

Theorem 10-1: N(p) (p — 4 — (— l) (p ~ 1)/2 ), where N(p) 

denotes the number of pairs of consecutive quadratic residues 
modulo p in [l,p — 1]. 

PROOF: Let c p (n ) be defined by the formula 

f 1 if both n and n+ 1 are quadratic residues modulo p, 
Cp(n) 10 otherwise. 

Then, 

N(p) =*2 c ^ n) - (10-1-1) 

n — 1 
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We note that 


c,<n)«i(l + (2))(l + (£±±)). (. 0 - 1 - 2 ) 

since the right side of (10-1-2) is 0 if either n or n + 1 is a nonresidue, 
and 1 when both are quadratic residues. Hence, 

The first sum in (10-1-3) is equal to (p - 2)/4. The second and 
third sums are relatively easy to evaluate. Since there are as many 

quadratic residues as nonresidues modulo p, we see that ^ is 

equally often —1 and+1 in the interval [l,p —1]. Consequently, 




(10-1-4) 


and therefore 


and 


If we could also prove that 

KfXT)- 1 ' 

then we would have the formula 


( P-D/2 


(10-1-5) 


N(p) (p —2 — (—l) (p_1)/2 —1 — 1) 


= 4 (P-4- (-1)(p-i )/2 ). 
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Theorem 10-3 will complete this proof by establishing (10-1-5). ■ 

Since sums similar to (10-1-5) will also play a significant role in 
Theorem 10-4, we shall prove two general, related results. 

Theorem 10-2: Suppose Cj is defined for all integers j , and 
Cj= c k whenever j = k (mod n). Let r u r 2 , . . ., r n he any complete 
residue system modulo n. Then 

Cj = c ri + Cr 2 H- . . . + Cr n . (10-1-6) 

j — o 

Remark: We often abbreviate (10-1-6) by writing 


n — 1 

2 c j = 2 Crj 

j = 0 r(mod n) 

where the second sum means c ri + c r2 -f . . . -f c Tn . 

PROOF: Since both {r l9 r 2 , . . r n } and {0, 1, 2, . . n — 1} 
constitute complete residue systems, each nonnegative integer i less 
than n is congruent to precisely one rj ; thus c t = c r .. Hence, the sums 
c 0 + c x + . . . + c n _! and c ri + c r2 + . . . 4* c Vn are merely commuta¬ 
tions of one another and are therefore equal. ■ 


Theorem 10-3: If p is an odd prime , then 


(n — a)(n — b ) N 
V 


p - 1 
-1 


if a = b (mod p), 
if a 4 b (mod p). 


Remark: (10-1-5) is the case a = 0, b = —1. 


Proof: By Theorem 10-2, 


S ( (n a) p (n — )= £ ( 

n —0 ' ^ ' n(modp) x 


(n — a) (n — b) 

p 


)■ 


Now, as n assumes all values in a complete residue system modulo p, 
so does n + a. Thus 

v> / (« ~ a)(n — b) \ _ y / n(n + a — b) \ 
a(modP) ' P ' ndnodpA P 


If a = b (mod p), then 

^n(n + a — = 1, for n + 0 (mod p), 
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and thus 




If a ^ b (mod p), let k = a — b; then X ^ 0 (mod p). Now (^j = 0, 
if n = 0 (mod p); consequently, we see that 


^ / n(n + X) \ = ^ / n(n + X) \ 

n(modp) \ V t n(modp) \ P / 

0(mod p) 


If n ^ 0 (mod p), there exists an n such that nn = 1 (mod p). Further- 
more, (~~j = 1. Thus, 


y / n(n + \) \ _ y /n 2 \ / n(n H 

nod p) \ P / n(modp) \P / \ P 


X) 


n(mod p) 
n # 0( mod p) 


n(mod p) 
n ^ 0 (mod p) 


nnjnn + Xn) \ 
P ) 


= 2 ( 

n(modp) \ 

0(mod p) 

= 2 

n(modp) \ H / 


n(mod p) 
n# 0(mod p) 


But now, as n assumes all values in a reduced residue system modulo 
p, so does h; since X ^ 0 (mod p), so does Xn. Thus, if we setm= Xn, 
then 

2 2 ( 1 ±“) 

w(modp) \ H / m(modp) \ H / 

n^O(modp) m^O(modp) 

= 2 - (-) 

m(modp) ' P / \P / 

= 0 - 1 (by (10-1-4)) 

= -!• ■ 

We have now also proved Theorem 10-1, since Theorem 10-3 
establishes (10-1-5). With minor alterations, the technique used in 
Theorem 10-1, combined with the general result in Theorem 10-3, 
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yields similar results for consecutive nonresidues modulo p. Such 
results are listed in the following exercises. 


EXERCISES 

1. Let Nj(p) denote the number of pairs of integers in 
[l,p —1] in which the first is a quadratic residue and the 
second is a quadratic nonresidue modulo p. Prove that 

N l (p) = \ (p-(-l) (p - m ). 

2. Let N 2 {p) denote the number of pairs of integers in 
[l,p—1] in which the first is a quadratic nonresidue and 
the second is a quadratic residue modulo p. Prove that 

N t (p) = \ (p-2+ (~iy p - vlt ). 

3. Let N-Ap) denote the number of pairs of integers in 
[l,p — 1] in which the first is a quadratic nonresidue and 
the second is a quadratic nonresidue modulo p. Prove 

that N 3 (P) = \ (p — 2 + (—1) <P_I> ' 2 ). 

4. Suppose Y(/) denotes the number of mutually incongru- 
ent solutions {x,y) of the congruence y 2 =/(x) (mod p) 
(p an odd prime), where/(x) is a polynomial with integral 
coefficients. Prove that 

Note: Two solutions (xi,t/i) and (x 2 ,y 2 ) are sa id to be 
mutually incongruent if either ^ x 2 (mod p) or t/ x + y 2 
(mod p), or both. 

5. Prove that there are p — 1 mutually incongruent pairs of 
solutions of the congruence y 2 = x 2 + 3x + 2 (mod p) 
(p an odd prime). 

6. Suppose g.c.d. (n,p) = 1, where p is a prime and n = q x ai . . . 
qfr y w ith q { prime. LetD (n) denote the number of divisors 
of n that are square-free and are quadratic residues modulo 
p. Prove that 


d( “ ) -5X( 1 + ©)' ,,M)| - 
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7. Using the notation of Exercise 6, prove that 

^, v f 2 r if all qff are quadratic residues modulo p, 
D <">-{ 2 '- otherwise. 

10-2 CONSECUTIVE TRIPLES OF 
QUADRATIC RESIDUES 

An approach similar to that in Theorem 10-1 leads to formulae 
for the number of consecutive triples of quadratic residues modulo p. 
The results in this section are somewhat harder to prove than those in 
Section 10-1; however, we not only obtain our main objective, but 
also unexpectedly discover an important result on numbers represent¬ 
able as a sum of two squares. 

Definition 10-1: Let v{p) denote the number of consecutive 
triples of quadratic residues in the interval [l,p—1]. 

Example 10-1: If p=ll, the quadratic residues less than 11 
are 1, 3, 4, 5, and 9; consequently, y(ll) = 1. As a check on Theorem 

10-1, we note that N (11) =2 — ^ (11 — 4 + 1). 

Theorem 10-4: If p is an odd prime , then 
v{p) = ^p + E rJ , 


where | E p | < ^ Vp + 2. 

Remark: As we shall see in the proof, a much more explicit 
but cumbersome value for E p can be given, especially when p = 3 
(mod 4). 

Proof.* Let C p (n ) be defined by the formula 

I I if n, n+ 1 , and n + 2 are all quadratic residues 
modulo p , 

0 otherwise. 


v ( p ) = 2 Cp( n )- 

n— 1 


Then, 
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As in Theorem 10-1, we see directly that 


C p (n)^ 1+ ~ 1 + 


Consequently, 

1 P-3 

v (p)=s 2 

° n = 1 






n + 1\ , /n + 2\ . (n\(n + 1 


P/\ P 


n\/n + 2\ , /n + l\/n+2 


V) \ V J \ P 
n\ (n+ 1\ /n + 2 


-i£ 1+ s£® + s£f? 


1 *-» /» + 1\ , 1 *U 3 /n + 2 


1 /n\ /« + 1\ , 1 


+ S 2 


8 A \P 


n\ (n + 2 


j- y /-ill 
8 ", \p/ V P 


, 1 V, 3 /n + 1 \ / n + 2 

8 2 v p A P 


I,?, 


n + l\ /n + 2 


Now the first seven of the last eight sums are of the type treated 
in Theorems 10-1 and 10-3. Therefore, 


v(p) = g Up - 3) + (0 


p-2\ _ ( P~l 


°-©-( £ A 


0 \pj \p 


+ -l- 


(p — 2) (p — 1) 


+ - 1 - 


(p-l)(p + l) 


+ -1- 


+ p ^ 3 /n(n + 1) (n + 2) 
n=l \ P 


( 10 - 2 - 1 ) 
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Hence, i£E p = v(p) - ^p, then by (10-2-1) 

O 


F i -.3111111.1 
\ E »\ < 8 + 4 + 4 + 4 + 4 + 4 + 4 + 8 


f n ( n + 1)( n + 2) N 


P-3 / 

X ( 

n= 1 \ 


P 


<2 + l 


1) (n + 2) 


p y / n(n 4-1)' 
n= 1 \ P 

To establish Theorem 10-4, we need only show that 


p -3 / 

2 ( 

n= 1 \ 


n(re + l) (re + 2) 
P 


j ^ 2 Vp. 


Define S (m) by the formula 

S (m) = y f "*"'-” 1 * ). 

H (^dp)V p / 


Then, 


s(D= 2 (■ 

n(modp) \ 


re(re — 1) (re + 1) 


= 2 

n(mod p) 


(n + l)n(n + 2) 
P 


-2( a 

n = 1 V 


(re + 1) (re + 2) 


P 


)■ 


Also, 


^re(re 2 — m) 
P 


) 


( 10 - 2 - 2 ) 


n = 1 V ^ 7 n = (p +1)/2 x 

— S^ 12 / re (re 2 — m) \ (P ^V /2 / (p — re) ((p — re) 2 — m) \ 

n =i \ P / n =i \P / 

_ (p ^ >/2 (re 2 — m) ^ 1^ (p ^ )/2 ^ re (re 2 — rre) ^ 


If p = 3 (mod 4), then 
Therefore, 

| Ep | <2, if p s 3 (mod 4). 

Now, if p = 1 (mod 4), we may only conclude that S (m) is an even 




-1, and so S(m)=0 in this case. 
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number, since 


ft)- 


1 and 




Suppose k is any integer such that k 4 s 0 (mod p). Then 



and 


$( m ) — s 

w(modp) \ r / 


= I 

w(mod p) 


^ ^ n(n 2 — m) 


- 2 ( 

n(modp) N 


kkn(k 2 n 2 — k 2 m) 
V 


) 


k\ y / /cn((/cn) 2 — fc 2 m) 

n(modp) N K 


As n assumes all values in a complete residue system modulo p, so 
does kn. Thus if h = kn , we see that 


S(m) = 


7c\ y / h (h 2 — k 2 rn) 


If j is a quadratic residue modulo p, then j = c 2 (mod p) for some c, 
and 

s ( i),(£)s«-)=(£)su). 

Thus 


| S(l) I = ISO') |. (10-2-3) 

Also, if l and r' are any quadratic nonresidues modulo p, then by 
Corollary 9-1 


l == g2a + l ( moc [ p) ? 
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and 

r' = g 26 + 1 (mod p). 

Hence, if b ^ a, and if we set t = g b ~ a , then r' = It 2 (mod p), and 


s(D = (|)s(fo*) = (|)s(r'). 


Consequently, 


S(i) | = | S(r') ]. 


(10-2-4) 


Hence we see that the absolute value of S (m) is completely deter¬ 
mined by whether m is a quadratic residue modulo p. Consequently, 
with p se 1 (mod 4), 

lE^H S (i) 2 + ^^-s(0 2 = x' s(m) 2 

^ " m= 1 


= X s (™) 2 

m(mod p) 

-2(2 (*i*^ 2 >))( 2 («t»i)) 

m(mod d) d) x " 7 7 x £(mod») N r 7 7 


= 2 2 2 (f)( (,n ‘’p” 1 — ) 

m(modp) s(modp) f(modp) x ^ 7 ' ^ ’ 

= 2 2 (?) 2 ( <m - st) p <m ~ t ’ ) )- 


s(mod p) /(modp) 7 m(modp) x ^ 

Now, applying Theorem 10-3 to the inner sum, we obtain the equation 


(P~ 1) 


^ - S (l). + l£_ii 


«(»■■ 2 2 (f)<p-D 

s(modp) f(modp) 

£ 2 •=s 2 (mod p) 

+ 2 2 (f)(-» 

s(modp) f(modp) N H 7 

f2s£s 2 (mod p) 


= 2 , p -l ) ( p- 1 ,- 2 (f)(o-(t)-(^)) 


= 2(p — l) 2 + 2(p—1) = 2p(p — 1). 
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Hence 

m+m-» 

Thus | S(l) | < 2p 1/2 , and this establishes (10-2-2). 

Corollary 10-1: Every prime p = 1 (mod 4) is representable 
as a sum of two squares. 

PROOF: The assertion follows from (10-2-5). ■ 

This corollary, which seems almost unrelated to Theorem 10-4, 
is an important result in itself and will form the basis of the next 
chapter. 

EXERCISES 

1. Use the results of Theorem 10-4 and Corollary 10-1 to 
construct solutions of x 2 + y 2 = 29. 

2. Using Exercise 4 of Section 10-1, prove that the congruence 

y 2 = x 3 + 3x 2 + 2x (mod p) 

has p mutually incongruent solutions if p = 3 (mod 4) and 
that it has at least p — 2 Vp mutually incongruent solutions 
if p = 1 (mod 4). 

3. Evaluate £ ( »(» +11 (» + 2 > (" + 3) ) for p = 5, 7, 11, 

and 13. How does the absolute value of your result compare 
with 3Vp ? 


(10-2-5) 
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ADDITIVITY 


Part II, Quadratic Congruences, has yielded important 
information about the representation of integers as sums of 
two squares. In Chapter 11, we shall investigate in detail the 
subject of representing integers as sums of squares. This 
problem is part of the broader question: In how many ways 
can a nonnegative integer n be represented as a sum of ele¬ 
ments of a set S? Aside from letting S be the set of perfect 
squares, we might take S to be the set of all positive integers, 
or the set of all odd positive integers, or any set whose ele¬ 
ments are integers. From these possibilities, there arise many 
surprising results which we shall study in Chapters 12, 13, 
and 14. 

Chapter 12, devoted to simpler additive concepts, sug¬ 
gests ways to make conjectures in additive number theory. 
Techniques of generating functions are introduced in Chapter 
13, and are then used in Chapter 14 to prove some of the 
conjectures formed in Chapter 12. 
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CHAPTER 11 


SUMS OF SQUARES 


Recalling the lagniappe we obtained as a corollary to Theorem 
10-4, we shall now find all numbers representable as a sum of two or 
four squares. The problem of representing integers as sums of squares 
dates back to Diophantus of Alexandria, who apparently knew that 
every integer is the sum of two, three or four positive integral squares. 


EXERCISE 

What does “lagniappe” mean, and why does it apply to 
Corollary 10-1? 


11-1 SUMS OF TWO SQUARES 

Theorem 11— Is A positive integer n can be represented as a 
sum of two squares if and only if its factorization into powers of dis¬ 
tinct primes contains no odd powers of primes congruent to 3 
modulo 4 . 

PROOF: Suppose n has the prime factorization 

n = 2 a p 1 /3 W 2 . . . pf r qi yi q 2 y2 . . - <7/*, 

where p t = 1 (mod 4) (1 < i < r), and where q } = 3 (mod 4) (1 ^s). 

Suppose that at least one of the y t is odd, say y 1 . If 

n = x 2 + y 2 

and d = g.c.d.(x,t/), then, with n x = n/d 2 , x 0 = x/d, and y 0 = y/d , we 
can assert that g.c.d. (x 0y y 0 ) = 1 and 


n x = n/d 2 = (x/d) 2 + ( y/d) 2 = x 0 2 + y 0 2 - 
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Now let y be the exponent of q x appearing in the prime factoriza¬ 
tion of n x ; y is odd, because n x = n/d 2 . Since q x divides n 1 = x<?-\- y£, 
q x could not divide either x 0 or y Q without dividing the other; and, 
since g.c.d.(x 0 ,t/ 0 ) = 1, we conclude that g.c.d.(x 0 ,qi) = g.c.d.(t/ 0 ,qi) — 
1. Hence, there exists an integer u such that 


x 0 - Vo u ( mod Qi)» 


by Theorem 5-1. Thus, 


0 s m = x 0 2 + y 0 2 = u 2 y$ + t / 0 2 = y 0 2 (l + tf 2 ) (mod q x ). 


Since q x does not divide t/ 0 2 > we see that 

u 2 4-1 = 0 (mod q x ) . 


Consequently, 


u 2 = —1 (mod q x ), 

which, by Theorem 9-5, is impossible. Hence, all the y x must be even 
if n is to be represented as the sum of two squares. 

Suppose now that all the y t are even, say y t = 28*. Thus, 

n = 2 a pj 3l . . . pf r (qi) 8 ' • • • ( q s 2 ) 8s • 

Now 2 = l 2 + l 2 , and qf = qf + 0 2 .* Furthermore, Corollary 10-1 
asserts that each p t is representable as the sum of two squares. Thus n 
is a product of numbers each of which is a sum of two squares. 

Note that 

(x 2 + y 2 )(v 2 4- w 2 ) = (xv + yw) 2 + (xw — yv) 2 . (11-1 -1) 

Hence, whenever two numbers are representable as a sum of two 
squares, so is their product; moreover, mathematical induction ex¬ 
tends this assertion to products of arbitrarily many factors. Thus n is 
indeed representable as a sum of two squares. I 

Example 11-1; Since 3 = 3 (mod 4), 5 = 1 (mod 4), and 7 = 3 
(mod 4), the factorization of 315 = 3 2 • 5 • 7 contains an odd power of a 
prime congruent to 3 modulo 4. By Theorem 11-1, 315 cannot be 
represented as a sum of two squares. 

*If none of the prime factors of n are congruent to 3 modulo 4, we merely alter our 
argument by omitting the factors q t . 
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Example 11-2: The factorization of 3185 = 5 * 7 2 • 13 contains 
no odd power of a prime congruent to 3 modulo 4. Hence 3185 is 
representable as a sum of two squares. To find two squares that sum 
to 3185, we first represent 5, 7 2 , and 13: 

5 = 2 2 +l 2 , 

7 2 = 7 2 4- 0 2 , 

13 = 3 2 + 2 2 . 


Therefore, 

3185=5 • 7 2 • 13 

= (2 2 + l 2 )(7 2 + 0 2 )(3 2 + 2 2 ) 

= (14 2 + 7 2 )(3 2 + 2 2 ) 

= (14 -3 + 7 -2) 2 + (14 *2-7 -3) 2 
= 56 2 + 7 2 . 

EXERCISES 

1. Represent 274625 (=5 3 * 13 3 ) as the sum of two integral 
squares. 

2. Represent 333 as the sum of two integral squares. 

3. We rec all that if % = x + iy is a complex number, then 
\z \ — Vx 2 + y 2 and z = x — iy. 

(a) Prove that zz = \z\ 2 

(b) Prove that if w = u + iv , then 

| W I | Z I = | WZ I . 

Discuss the relation of this last result to equation (11-1-1). 


11-2 SUMS OF FOUR SQUARES 

The proof that every positive integer can be represented as a sum 
of four nonnegative integral squares requires a preliminary result. 

Theorem 11-2: For each prime p there exist integers A, B, and 
C, not all zero , such that 


A 2 + B 2 + C 2 =0 (mod p). 
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Proof: If p = 2, take A = 1, B = 1, C = 0. If p = 1 (mod 4), 
choose A as a solution of x 2 = —1 (mod p), B = 1, C = 0. If p = 3 
(mod 4), set C = 1. Then we must solve the congruence 


A 2 + B 2 = —1 (mod p). 


Let d be the least positive nonresidue modulo p; then 




= (—1)(—1) =1, and therefore — d is a quadratic residue 


modulo p. Also, d ^ 2, since d is a nonresidue. Choose B so that 
B 2 = —d (mod p). Then we must find an A such that 


A 2 = d — 1 (mod p). 


Note that d — 1 is a quadratic residue modulo p, since it is both posi¬ 
tive and less than d, the least positive nonresidue modulo p. Thus, 
this last congruence is clearly solvable. ■ 

We are now prepared to prove the famous result of Lagrange. 

Theorem 11-3: Every positive integer is a sum of four non¬ 
negative integral squares. 

Proof: Our theorem on two squares relied on equation 
(11-1-1), a rather cumbersome algebraic identity. An equation similar 
to (11-1-1) is also necessary here, namely 

(a 2 + b 2 + c 2 + d 2 ) (e 2 +f 2 -hg 2 + h 2 ) 

= (ae + bf+ eg 4- dh) 2 + ( af— be — ch + dg) 2 
+ (ag + bh — ce — df) 2 + (ah — bg + cf — de) 2 . (11-2-1) 

This equation shows that if each multiplicand of a product is repre¬ 
sentable as a sum of four squares, so is the product. 

To complete our theorem we need only show that every odd 
prime is representable as a sum of four squares. 

By Theorem 10-2, we know that there exist integers A, B, and C, 
not all zero, such that 

A 2 + B 2 + C 2 s 0 (mod p). (11 -2-2) 

We may write (11-2-2) equivalently as 


A 2 + B 2 + C 2 + D 2 = Kp, 


(11-2-3) 
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where K is an integer and D = 0. Take k to be the least positive 
integer such that kp is the sum of four nonnegative integral squares. 
Equation (11-2-3) and the least-integer principle (see Exercise 2 of 
Section 2-1) guarantee that k must exist. 

In (11-2-2) and (11-2-3), we may choose A, B, C, and D in the 

interval To, ^); thus. 


kp < 4 (|) = p 2 ; 

that is, 

k < p. 

To finish the proof, we need only show that k — 1. We therefore 
assume that k > 1, and we derive a contradiction by first considering 
k odd, and second, k even. 

Suppose that k > 1 and k is odd. Then with 


kp = a 2 + b 2 + c 2 + d 2 , (11 -2-4) 

we choose a , fo, c, and d from the interval j^O, so that a = a, b = b, 
c = < 3 , and d = d (mod fc). Hence, 

a 2 +I? 2 + c 2 + d 2 = 0 (mod k), 
and thus there exists l ^ 0 such that 

kl = a 2 + fo 2 + c 2 + d 2 . (11 -2-5) 

Clearly l < k, since each of a, fo, c, and d is less than-|. Now sup- 

pose that l is zero; then a = b = c — d = 0, and therefore a = fo = c = 
d = 0 (mod fc); hence fc 2 1 kp. But this last assertion implies fc | p, 
which is impossible since 1 < k < p; hence 1^0. By (11-2-1), 
(11-2-4), and (11-2-5), we see that 

( kp)(kl) = (a 2 + b 2 + c 2 + d 2 )(a 2 + b 2 + c 2 + d 2 ) 




146 


SUMS OF SQUARES 


Each of the four expressions on the right hand side of (11-2-6) is a 
multiple of k; this is clear for the latter three, since a=a,b = b,c = c, 
and d = d (mod k ). For the first expression, we note that 

ad 4- bb + cc 4- dd = a 2 + b 2 + c 2 + d 2 = kp = 0 (mod k). 

Consequently, there exist integers a, fi, y, and 8 such that (11-2-6) 
may be rewritten as 

k 2 pl = (a/c) 2 -f (/3/c) 2 + (y Zc) 2 + (8k) 2 , 


or 


Ip = a 2 + j6 2 4 y 2 4- 8 2 


(11-2-7) 


where l < k. 

If k is even, we see that either all of a, b, c, and d are even, or 
two are even and two are odd, or all are odd. In any event, we can 
choose a, b, c, and d so that 

a = b (mod 2), c = d (mod 2). 

Hence, 



Therefore, whether k is even or odd, if k > 1, we can find an in¬ 
teger less than k, say k , such that kp is the sum of four nonnegative 

integral squares ( k — l if k is odd, k = -^k if k is even). However, 

since we chose k to be the least positive integer for which kp is the 
sum of four nonnegative integral squares, we have a contradiction. 
Therefore k must equal 1. Thus (11-2-4) holds with k= 1, and our 
proof is complete. ® 

In this chapter we have discussed the circumstances under which 
the equations 

x 2 + y 2 = n 

and 

x 2 + y 2 -f z 2 + ttf 2 = n 


have integral solutions. Theorems 11—1 and 11-3 form a small part 
of the theory of Diophantine equations. 

After treating squares, we may inquire about the solvability of 
Diophantine equations involving cubes or higher powers. In 1770, E. 
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Waring stated that every integer is a sum of at most 9 (positive 
integral) cubes, a sum of at most 19 biquadrates, and so forth. This 
assertion has been interpreted to mean that for each integer m ^ 2, 
there is a corresponding integer N m such that each positive integer n 
is a sum of at most N m positive integral rath powers. D. Hilbert, one 
of the greatest mathematicians of the twentieth century, proved this 
assertion to be correct in 1909. His first proof depended on the use of 
a 25-fold integral. Succeeding mathematicians such as G. H. Hardy, 
J. E. Littlewood, and I. M. Vinogradov have made significant con¬ 
tributions to Waring’s problem by determining sharp estimates for 
the size of N m . 

Perhaps the most well-known Diophantine equation is due to 
P. Fermat: 


+ y n =z n . (11-2-8) 

Fermat noted in the margin of his copy of Diophantus’ works: 

It is impossible to separate a cube into two cubes, or a biquadrate 
into two biquadrates, or in general any power higher than the second 
into powers of like degree; I have discovered a truly remarkable 
proof which this margin is too small to contain. 

However, no proof that (11-2-8) lacks solutions when n > 2 has ever 
been found among Fermat’s papers, nor has anyone been able to 
produce a proof. It has been shown that for 2 < n ^ 25,000, (11-2-8) 
has no solutions; however, this is a long way from a complete proof. 

EXERCISES 

1. Prove (without assuming Corollary 10-1) that, if p is a 
prime = 1 (mod 4), then there exist positive integers ra, 
x , and y such that 

x 2 3 + y 2 = mp , 

with pfx , pt?/, 0 < m < p. [Hint: Use the proof of 
Theorem 11-2.] 

2. Let ra 0 be the least possible ra in Exercise 1. Prove that if 
m oV = *o 2 + !/o 2 , then ra 0 does not divide both x 0 and t/ 0 un¬ 
less ra 0 = 1. 

(In Exercises 3 through 7, assume ra 0 > 1.) 

3. Prove that there exist integers a and b such that, in the 
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notation of Exercise 2, | x 0 — am 0 | — ^ m 0 , I !/o — bm 0 | ^ 
i m 0 , and (x 0 — am 0 ) 2 + ( y 0 ~ bm 0 ) 2 > 0. 

4. In the notation of Exercise 3, let x t = x 0 — am 0 and y t = 
y 0 — bm 0 . Prove that0 < x? + y 2 < m 2 , andx 2 + y t 2 = 
where m, is some integer such that 0 <m, < m 0 . 

5. In the notation of Exercise 4, prove that 

mgm y p = (x 0 2 + Do) (x 2 + y 2 ) 

= (xo*! + I/oI/i) 2 + (*o«/i — x 1 y 0 ) 2 - 

6. Prove that m 0 | (x 0 X! + yoyi) and m 0 | (x 0 yi — Xjyo). 

7. Letting x 2 — (x 0 x, + y 0 yi)hn 0 and y 2 = (x 0 yi - x 1 y 0 )lm 0 , 
use Exercises 5, 6, and 7 to prove that m^p — x 2 2 + y 2 . 

8. Use the results of Exercises 1 through 7 to give a new proof 
that every prime p = 1 (mod 4) is a sum of two squares. 

9. Prove that no integer of the form 4“(8m + 7) is the sum of 
three squares. [Hint: Examine the congruence 

x 2 +y 2 + z 2 = 7 (mod 8).] 
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ELEMENTARY PARTITION THEORY 


12-1 INTRODUCTION 

The theory of partitions is an area of additive number theory, a 
subject concerning the representation of integers as sums of other 
integers. An elementary example of an additive theorem is the basis 
representation theorem (Theorem 1-3). 

Definition 12-1: A partition of a nonnegative integer n is a 
representation of n as a sum of positive integers , called summands or 
parts of the partition. The order of the summands is irrelevant. 

Example 12-1: The partitions of 5 are 5, 4+1, 3 + 2, 3+1 + 1, 
2 + 2 +1, 2 + 1 + 1 +1, and l + l + l + l + l. 

A sum such as 2 + 1 + 2 is considered identical with 2 + 2 + 1, 
since order is irrelevant. Thus, there are seven partitions of 5. Since 
order is irrelevant, we shall henceforth write partitions with non¬ 
increasing order of summands. 

We observe that 0 has one partition, the empty partition, and that 
the empty partition has no parts. 

Definition 12-2: The function p(n) will denote the number 
of partitions of n. 

Example 12-2: p(0) = 1 and p( 5) = 7. 

L. Euler was the first mathematician to discover important 
properties of p(n); in 1748, he presented his results in his book 
Introductio in Analysin Infinitorum. 

You may believe that the techniques of calculus have little rela¬ 
tion to something defined as simply as p (n). If so, a formula found by 
G. H. Hardy and S. Ramanujan in 1917 may surprise you. They ob- 


149 




150 


ELEMENTARY PARTITION THEORY 


tained an exact expression for p(n), the first term of which is 



/2ir / 1\ 

1 d 

eXP W6 V““24j 

27r\^2 dn 

/ 1 


l 

to| 
>M 


As you may easily discern, p(n) grows astronomically. Actual 
enumeration of the 3,972,999,029,388 partitions of 200 would certainly 
take more than a lifetime. However, the first five terms of the re¬ 
markable Hardy-Ramanujan formula give the correct value of p (200). 

To emphasize that p(n) is not as simple as it appears, we note 
Ramanujan's congruence 

p(5n + 4) = 0 (mod 5). (12-1-2) 

The study of congruences such as (12-1-2) has involved some very 
deep properties of elliptic modular functions. 

Partition identities, a subject we shall treat in great detail, is 
best exemplified by a theorem of Euler: 

Theorem 12-1: The number of partitions of an integer n in 
which all parts are odd equals the number of partitions of n in which 
all parts are distinct. 

Example 12-3: There are three partitions of 5 into odd parts: 
5, 34-1 + 1, l + l + l + l + l; and three partitions of 5 into distinct 
parts: 5, 4 + 1, 3 + 2. 

Later in the chapter, we shall prove Euler's theorem and under¬ 
take discovery of new partition identities. First we consider a simple 
geometric device useful in partition theory. 


12-2 GRAPHICAL REPRESENTATION 

Some partition problems are solvable by graphical representation. 
In a graphical representation, a partition is represented by horizontal 
rows of dots. The graphical representations of the partitions of 5 are: 


5 


4 + 1 


3 + 2 


3 + 1 + 1 
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Note that the left-hand dots of each row lie on a vertical line and 
that the dots are equidistant. 

Graphical representation allows us to introduce an important 
transformation of partitions. 

Definition 12-3: The conjugate partition of a given partition 
is formed by reading the number of dots in the successive columns of 
the graphical representation . 

Example 12-4: The graphical representation for 5+ 4-hi, a 
partition of 10, is 


The conjugate partition, then, is 

COL. 1 COL.2 COL.3 COL.4 COL.5 

10=3 + 2 + 2 + 2 + 1 

Example 12-5: The graphical representation for8 + 4 + 3 + l + l, 
a partition of 17, is 


The conjugate partition, then, is 

COL.l COL.2 COL.3 COL.4 COL.5 COL.6 COL.7 COL.8 
17= 5 + 3 + 3 + 2 + 1 + 1 + 1 + 1 

To illustrate the graphical technique, we shall prove the follow¬ 
ing theorem. 


Theorem 12-2: If p m (n) denotes the number of partitions of 
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n in which at most m parts appear , and if 7r m (n) denotes the number 
of partitions of n in which each part is no larger than m y then 

PROOF: Let us consider any partition of n in which at most-ra 
parts appear. The conjugate of such a partition has no parts larger 
than m since there could be at most m dots in any column of the 
graphical representation of the original partition. 

This pairing of each partition of n of at most m parts with its 
conjugate, a partition of n in which no parts are larger than m, estab¬ 
lishes a one-to-one correspondence between the two types of parti¬ 
tions. Hence, there must be the same number of each type; that is, 

Pm(n) = vr m (n). ■ 

Example 12-6: Let n = 5 and m = 3; the five partitions of 5 
with at most 3 parts are 5, 4 + 1,34-2, 3 + 1 + 1, and2+2 + 1, and 
the five partitions of 5 with no part exceeding 3 are 3 + 2, 3 + 1 + 1, 
2+2 + 1, 2 + 1 + 1 +1, and l + l + l + l + l. The pairings are: 

5 *- > l + l + l + l + l 




3 + 2 <-> 2 + 2 + 1 


3 + 1 + 1 «-> 3 + 1 + 1 


-> 


2 + 2 + 1 * 


3 + 2 
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EXERCISES 

1. Form the graphical representation of each of the following 
partitions, and find the conjugate partition in each case. 

(a) 6+4+3+2+1+1 (d) 9 + 8 4- 3 + 1 

(b) 7 + 5 + 1 (e) 8 + 6 + 24-2 + 1 

(c) 10+ 7 + 64-3 + 3 + 3 (f)4 + 3 + 2 + l 

2. List, for the case n = 6, m = 4, the pairings of partitions 
described in the proof of Theorem 12-2. 

3. List, for the case n = 7, m = 3, the pairings of partitions 
described in the proof of Theorem 12-2. 

4. A partition is called self-conjugate if it is identical with its 
conjugate. Thus, the two self-conjugate partitions of 8 are 
4 + 2 + 1 + 1 and 3+3 + 2. The graphical representations 
of these partitions are 


and 


respectively. We may now form two new partitions of 8 
by uniting dots lying on successive right angles as follows: 


This produces 7 + 1 and 5 + 3. Prove that this procedure 
establishes that the number of self-conjugate partitions of 
n always equals the number of partitions of n into distinct 
odd parts. 

5. Let n ) denote the number of partitions of n with dis¬ 

tinct odd parts. Utilize Exercise 4 to prove that p(n) —O (n) 
is the number of partitions of n that are not self-conjugate. 

6. Use Exercise 5 to prove that p(n) = d7(n) (mod2). 
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12-3 EULER'S PARTITION THEOREM 

We shall now establish the theorem of Euler stated in Section 12-1. 

Theorem 12-3: The number of partitions of an integer n in 
which all parts are odd equals the number of partitions of n in which 
all parts are distinct. 

Proof: As in Theorem 12-2, we shall establish a one-to-one 
correspondence between the two types of partitions. 

Let us first consider a partition having only odd parts. Iff denotes 
the number of times i appears as a part, we may write our partition as 


n=l-f 1 + . .. + 1 + 3 + 3 + .. .+3 + 5 + 5 + . ..+5 + . . .+ 


/i times / 3 times / 5 times 

(2M-1)+ (2M-1) + . . .+ (2M—1) 

Um-\ times 

= /i * 1 +/ 3 * 3 +/ 5 • 5 + . . . + /,*-1 * (2M - 1). (12-3-1) 

By the basis representation theorem (Theorem 1-3), we know that 
each f can be represented uniquely as a sum of distinct powers of 2. 
Thus, 

n= (2 a + 2 ft + . . . +2 C )1 4- (2 e + 2 / + . . . +2^)3 + . . . 

+ (2 r + 2* + . . .+2')(2M-1), (12-3-2) 

and so 

n = 2 a + 2 b + . . . + 2 C + 3 * 2 e 4- 3 • 2 f + . . . +3 • 2 9 4- . . . 

+ (2M — l)2 r -h (2M — 1)2 S + . . . + (2M — 1)2*. (12-3-3) 

This last expression is a partition of n into distinct parts, for we know 
by the fundamental theorem of arithmetic that numbers with dif¬ 
ferent powers of 2 in their prime factorization, or with distinct largest 
odd factors, are distinct numbers. 

We must now verify that we have established a one-to-one cor¬ 
respondence between the partitions of n with odd parts and those 
with distinct parts. Suppose we start with a partition of n into distinct 
parts. Then we can write each part as the product of an odd number 
and a power of 2; we thus obtain an expression like (12-3-3). We now 
collect all parts with identical largest odd factors. Then, factoring out 




12-4 SEARCHING FOR PARTITION IDENTITIES 


155 


the various odd factors, we obtain (12-3-2). By merely summing the 
various powers of 2 in (12-3-2), we obtain a partition of n with only 
odd parts as in (12-3-1). Thus, having established our one-to-one 
correspondence, we have proved the theorem. ■ 

Example 12-7: When n = 5, the pairing we have established in 
the proof of Theorem 12-3 is 

5 <■-> 5 

3 + l + l= 2*l+3 *-»3+2 

1+1+1+1+1=5-1 «-» (4 + 1) • 1=4 + 1. 

EXERCISES 

1. For the case n — 9, list the pairings of partitions described 
in the proof of Theorem 12-3. 

2. Prove that the number of partitions of n in which no part is 
repeated more than k — 1 times equals the number of par¬ 
titions of n in which no part is divisible by k. [Hint: Follow 
the proof of Theorem 12-3, using representation of/* to the 
base k rather than to the base 2.] 

12-4 SEARCHING FOR PARTITION IDENTITIES 

In this section we shall attempt to ferret out results similar to 
Euler’s Partition Theorem. Our object here is not proof , but discovery. 

To illustrate how we conduct the search, we forget Theorem 12-3 
and attempt to discover it. 

Definition 12-4: If S is a set of positive integers , let p(S,n) 
denote the number of partitions of n in which each summand is an 
element of S. 

Problem 12-1 (Discovering Euler's Partition Theorem): If 
D l (n) denotes the number of partitions of n into distinct parts, can we 
find a set S 1 such that 


p(S 1? n) = Dj(n) 


for all n? 

Since D 1 (l)= 1, we see that 1 E S x (otherwise p(Sj,l) =0). Since 
D/2) = 1, we see that 2 S! (otherwise p(S x ,2) = 2). Since 1/(3) = 
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Table 12-1: Discovering Elements of S t . 


n 

D,(n) 

p(S lf n) if 
n $ S, 

p(S,,n) if 
n G S, 

Conclusion 

1 

1 

0 

1 

l e Si 

2 

1 

1 

2 

2 $ S, 

3 

2 

1 

2 

3 g s, 

4 

2 

2 

3 

4 $ S, 

5 

3 

2 

3 

5 e s, 

6 

4 

4 

5 

6$ S, 

7 

5 

4 

5 

7 G S, 


2, we see that 3 G Sj (otherwise p(S 1? 3) = 1). Since Di(4) =2, we 
see that 4 $ Sj (otherwise p(S 1? 4) = 3). Since Dj(5) = 3, we see that 
5 G Sj (otherwise p(S u 5) =2). Since D t ( 6) =4, we see that 6 $ S t 
(otherwise p(S u 6) =5). Since D x (7) =5, we see that 7 E Sj (other¬ 
wise p(S 1? 7) = 4). Table 12-1 contains a summary of these observations. 
It is thus a reasonable guess that S x is the set of all odd numbers. 

We shall now use the same approach to “discover” a famous 
theorem of L. J. Rogers (also discovered independently by S. Ramanu¬ 
jan and I. J. Schur). 

Problem 12-2 (Discovering the First Rogers-Ramanujan 
Identity): Let D 2 (n) denote the number of partitions of n in which 
any two summands differ by at least 2. Can we find a set of integers 
S 2 such that 


p(S 2 ,n) = D 2 (n) ? 

Let us form a table, proceeding as before. 


Table 12-2: Discovering Elements of S 2 . 


n 

D 2 (n) 

p(S 2 ,n) if 
n E S 2 

p(S 2 ,n) if 
n ^ S 2 

Conclusion 

1 

1 

1 

0 

1 e s 2 

2 

1 

2 

1 

2 $ s 2 

3 

1 

2 

1 

3 S 2 

4 

2 

2 

1 

4 G S 2 

5 

2 

3 

2 

5 $ s 2 

6 

3 

3 

2 

6 G S 2 

7 

3 

4 

3 

7 $ S 2 

8 

4 

5 

4 

8 $ S 2 

9 

5 

5 

4 

9 G S 2 

10 

6 

7 

6 

10 $ S 2 

11 

7 

7 

6 

11 G S 2 

12 

9 

10 

9 

12 $ S 2 

13 

10 

11 

10 

13 $ S 2 

14 

12 

12 

11 

14 G S 2 

15 

14 

15 

14 

15 $ S 2 

16 

17 

17 

16 

16 G S 2 
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We see that 1, 4, 6, 9, 11, 14, 16 are the first seven elements of S 2 . 
So far, these are all the numbers one unit away from a multiple of 5. 
Thus, we may conjecture that S 2 consists of all positive integers 
congruent to either 1 or 4 modulo 5. We shall see in Chapter 14 that 
this is indeed correct. 

Definition 12-5: D d (n) will denote the number of partitions 
of n in which any two summands differ by at least d. 

We note that the previously defined quantities Dj(n) and D 2 (n) 
represent the special cases of D d (n ) for d = 1 and d = 2. 

To illustrate some of the pitfalls of searching for partition 
identities, we proceed to study D 3 (n). 

Problem 12-3: Can we find a set of integers such that 
p(^ 3 ,n) = D 3 (n) ? 

Again we construct a table. 

Table 12-3: Discovering Elements of . 


p(y 3 ,n) if p{Sf 3 ,n) if 

n D 3 (n ) n €E y 3 n ^ y 3 Conclusion 


1 1 

2 1 

3 1 

4 1 

5 2 

6 2 

7 3 

8 3 

9 4 

10 4 


1 o iey 3 

2 1 2 $ 

2 1 3 $ y 3 

2 1 4 $ 

2 1 5 e 

3 2 6 $ & 3 

3 2 7 ey 3 

4 3 8 $ 

4 3 9 ey 3 

6 5 ? 


Thus we see that 5^ 3 cannot exist, a fact that may be formulated 
in the following theorem. 

Theorem 12-4: There exists no set of integers such that 


p(y 3 ,n) = D 3 (n). 


It may seem that we are stymied in finding a new positive result; 
however, there is a result in which a partition function only slightly 
different from D 3 (n) is considered. 

_ If S 3 denotes the set of integers congruent to 1 or 5 modulo 6, and 
if D 3 (n) denotes the number of partitions of n in which the difference 
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between any two parts is at least 3, and in which no consecutive 
multiples of 3 appear, then p(S 3 ,n) = D 3 (n). 

This result, originally discovered by I. J. Schur, will be proved 
in Chapter 14. Schur’s result is the first indication that, in order to 
obtain valid theorems, we need to introduce new partition functions 
like D 3 (n). 

To prove Schur’s theorem and the theorem of Rogers and 
Ramanujan, we must first study generating functions. 

EXERCISES (In the j th exercise below , attempt to define a set of 
integers W 5 such that E 5 (n) = p (W j9 n) for all n.) 

1. Ei(n) denotes the number of partitions of n in which all 
parts differ by at least 2 and no consecutive even numbers 
appear as summands. 

2. E 2 (n) denotes the number of partitions of n in which all 
parts differ by at least 2 and no consecutive odd numbers 
appear as summands. 

3. E 3 (n) denotes the number of partitions of n in which all 
parts differ by at least 2, no part is less than 3, and no con¬ 
secutive even numbers appear as summands. 

4. E 4 (n) denotes the number of partitions of n in which all 
parts differ by at least 2, no part is less than 2, and no con¬ 
secutive odd numbers appear as summands. 

5. E 5 (n) denotes the number of partitions of n in which no 
part appears more than twice. 

6. E 6 (n) denotes the number of partitions of n in which the 
difference between any odd part and any other part not 
exceeding it (if such exists) is at least 3, and no part is less 
than 2. 

7. E 7 (n) denotes the number of partitions of n in which no 
part appears more than twice; and, if a part m appears 
twice, neither m — 1 nor m + 1 appears at all. 

8. E s (n) denotes the number of partitions of n in which all 
parts differ by at least 6, neither 1 nor 3 appears as a part, 
and, if m and m + 6 appear as parts, then m ^ 0, 1, 3 
(mod 6). 
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9. E 9 (n) denotes the number of partitions of n in which the 
difference between any odd part and any other part not 
exceeding it (if such exists) is at least 1. 

10. Eio(n) denotes the number of partitions of n in which no 
part appears more than thrice with the restriction that, if 
any part m appears 2 or 3 times, then m + 1 appears at 
most once. 

11. E n (n) denotes the number of partitions of n in which no 
part is less than 2, no part appears more than twice, and 
no consecutive integers appear as summands. 

12. E 12 (n) denotes the number of partitions of n into parts 
such that no consecutive integers appear as summands, 
and no part is less than 2. 

13. E 13 (n) denotes the number of partitions of n in which no 
part appears more than once and no multiple of 3 appears. 



CHAPTER 13 


PARTITION GENERATING FUNCTIONS 


In Chapter 12, we used purely elementary procedures to establish 
theorems about various partition functions. We can obtain far more 
interesting results about partitions by using generating functions, 
many of which are infinite products. An infinite product is defined in 
much the same way as an infinite series. 

Definition 13-1: If a 0 , a u a 2 , ... is a sequence of num¬ 

bers , then the nth partial product of the a t is 

n 

11 dj = Clo(liQ'2 • • • d n , 

3 = 0 

and the infinite product of the a t is 

go ti 

P dj = lim dj, 

j=0 )=0 

if that limit exists and is not zero. Otherwise , the product is said to 
diverge. 

We shall first study the relationship between infinite products 
and partition functions. Afterwards, we shall examine some theorems 
relating infinite products and infinite series; these results will be 
valuable to our understanding of partition identities in Chapter 14. 


13-1 INFINITE PRODUCTS AS GENERATING 
FUNCTIONS 

THE GENERATING FUNCTION FOR n m (n) 

Theorem 13-1 illustrates a relationship between products and 
partition functions by exhibiting the generating function for n m (n), 
the number of partitions of n in which no part is greater than m. 

160 



13-1 INFINITE PRODUCTS AS GENERATING FUNCTIONS 161 


qo m ^ 

Theorem 13-1: ^ = Y1 T~Z —j * where | q | < 1. 

PROOF: Recall the formula for the sum of a geometric series, 
— = 1 + x + x 2 + x 3 + x 4 + . . ., where | x | <1. 

1 — X 1 

(We can obtain this by letting n—» 00 in Theorem 1-2.) 

Thus, 

fr 1 = 1 1 1 1 

11 l — qj 1 — q 1 — q 2 1 — q 3 1 — q m 

= (1 + q 1 + q 2 1 + q 3 ' 1 + q 4 1 + . . .) 

• (1 + q 2 + <7 2 - 2 + q 3 * 2 + <7 4 ‘ 2 + . . .) 

• (1 4- q 3 + q 2 ’ 3 4- q 3 ' 3 + q 4 ' 3 + . . .) 

• (1 + q ,2 ‘ m + (/ 3 ' m + (/ 4,wl + . . .). 

Now, since the number of these series is finite and all converge 
absolutely for \q \ < 1, we may multiply them and collect terms. 
Hence, 

m 1 

II Tz r a ] = 1 + q 1 + (v 2 i + q 2 ) 

j=l q 

+ (q 3 ' + q 2 + ' + q 3 ) 

+ ( q 41 + q 2 + 2 1 H- </ 22 + c/ 3 + i + </ 4 ) 

+ . . .. 


Let us examine one of the lines above, say q 4 * 1 + q 2 + 2 ' 1 + q 2 * 2 + q 3 +1 + 
q 4 The exponents are 4*1 (=14-1 + 1 + 1), 2+2*1 (=2 + 1 + 1), 
2 • 2 (=2 + 2), 3 + 1, and 4; these are precisely the partitions of 4. In 
the general case, the expression for each exponent of sum n contains 
7r m (n) terms, one corresponding to each way of writing the exponent 
n as a sum of the form/j • 1 + f 2 • 2 + . . . +/ m • m. This last expression 
is merely the partition of n into parts each not exceeding m, where 1 
appears f x times, 2 appears/ 2 times,. . ., and m appears f m times. Thus, 

m 1 oo 

n l — n j = 2 7r »i( n )Q n * B 

j=l ^ n=0 
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THE GENERATING FUNCTION FOR p{n) 

oo 

To study the generating function p(n)q n for p(n), we need 
the following result. ” = 0 

Theorem 13-2: lim p (n) 1,n = 1. 

n-*°o 


This theorem and the root test of calculus imply that ^ p(n)q n 

« = 0 

converges for | q \ < 1. Rather than disrupt our discussion, we refer 
interested readers to Appendix A for a complete proof. 

00 00 1 

Theorem 13-3: ^ P( n )Q n = Yl yzr j > where \ q\ <1. 

m — o i=i ^ Q 


Proof: Note that if m ^ n, then i r m (n) = p(n), since no part 
of a partition can exceed the number being partitioned. Clearly, 
0 ^ 7r m (n) ^ p{n) for all m and n. Consequently, 


2 v( n )Q n ~fl yz: 


3=1 


2 p(n)q n - 2 ^m(n)q w 


= fl (P(^) - Tr m (n))q n 

n = m+ 1 

£ 2) p(n) | q\ n . 

n = m + 1 


Since Theorem 13-2 shows that the series y p(n)q n converges for 

n = 0 

| q | < 1, we see that 


lim V p(n) | q | M = 0; 

m— 


hence, 



THE GENERATING FUNCTIONS FOR p{S d ,n) 

In exactly the same way, one may establish the formulae: 
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X P(Si,n)g"=n i (13-1-D 

i p(M</*=n 03-1-2) 

| o P(S 3 ,n)q n = ]Q ^ , (13-1-3) 

where S d is the set of all positive integers congruent to 1 or to d+2 
modulo d 4- 3; the series and products converge for | gr | < 1. 

THE GENERATING FUNCTION FOR d m (n) 

By slightly altering the proofs of Theorems 13-1 and 13-3, we 
can find the generating function for partitions with distinct parts. 

Theorem 13-4: If d m {n) denotes the number of partitions ofn 
into distinct parts , none greater than m , then 

°o m 

X d m (n)q n = Y[ (l + </ J ). 

n ~o j=l 

m 

PROOF: Yl (1 + qi) = (1 + q) (1 + q 2 ) (1 + q 3 ) . . . (1 + q») 

j= 1 

= 1 + q q 2 + ( q 2 + 1 + q 3 ) + (q 4 + q 3 + 1 ) 

+ (q 5 + q* + 1 + q 3 + 2 ) 4- . . .. 

In general, the term q n appears as often as n can be expressed as a 
sum of distinct integers, each not exceeding m. Hence, 

m oo 

fl (1 + q } )='2 d m {n) q n . ■ 

3=1 n = 0 


THE GENERATING FUNCTION FOR D,(n) 

We can now obtain the generating function for D,(n) from 
Theorem 13-4, just as we obtained the generating function for p(n) 
from Theorem 13-1. 
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Theorem 13-5: 2 D 1 (n)q n = (1 + q J ), where \ q | <1. 

n = 0 j= 1 

PROOF: As in the proof of Theorem 13-3, we note that 
d m (n) = Dj(n), if m ^ n. Also, 0 ^ d m (n) ^ Dj(n) ^ p{n) for all m 
and n. Consequently, 

oo m 

2 D 1 (n)q n -f[ (1 + Q 1 ) 

n-o 5=1 

= 1 2 (E\(n) - d m (n))q n 

I n=m+ 1 


2 D 1 {n)q n ~2 d m {n)q n 


* 2 E>i(n)l</I n 

n = m + 1 

- 2 p(n)\q\". 

n = m+ 1 


As in the proof of Theorem 13-1, we see that 
lim V p(n)\q\ n = 0 

m ^°° V, ^ _L 1 


for | q | < 1; hence 


oo m °° 

y D 1 (n)q" = lim TT (1 -§- </ j ) = !”[ (l + q J )- 


SOME APPLICATIONS OF GENERATING FUNCTIONS 

We shall now see the utility of generating functions in establish¬ 
ing relations between various partition functions. First we give a 
different proof of Theorem 12-1. 


2 £>>(«)<?"=n 

n = 0 j = 1 

_A (l + q J )(l-<7 J ) 


oo 


=n 

i=i 


(1 - <? 2J ) 
(1 - t7 J ) 
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00 00 1 

-R (1 ~^ 

~jji ^ q ^ (1 - q 2i )(l ~ 

°° 1 

= n (1-^-*) 

= 2 P(Si,n)q*. 

n i. = 0 


Now, as we remarked in Section 3-4, a function has at most one 
MacLaurin series expansion. Consequently, £h(n) = p (S^n) for 
all n. ■ 

This proof of Euler’s Partition Theorem relies on manipulations 
and rearrangements of infinite products. The correctness of such pro¬ 
cedures is established in Appendix B. 

We can now prove a result of I. J. Schur that also relies on the 
manipulation of infinite products. 

Theorem 13-6: If Q 3 (n) denotes the number of partitions of 
n into distinct parts , no part being a multiple of 3, then Q 3 (n) = 
p(S 3 ,n) for all n. 

PROOF: As with D a (n), we can prove that 

2 Qs(n)q n ~f{ (1 + q 3i-2 )(l + q 3j_1 ). 

n = 0 j = 1 


Hence, 

2<? 3 (n)Q"=n (1 + q 3i ~ 2 ) (1 + q 3i_1 ) 


” (1 + <jrW-‘)(l + q 3i ~ 2 ) (1 - q 3i ~ l )(l-q 3i ~ 2 ) 

M (l-qfW-^a-</«-*) 

» (i- q «-»)(i- q «-4) 


= n a-q 6J ~ 2 )(l-q 6j - 4 ) n 

j=l i= 1 


1 
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-n a-q 6 i - 2 )(i-q 6i ~ 4 ) 

j=l 

rr 1 

n (i - (i - c^- 1 ) (i - (? 6j - 5 ) (i - <7 ei - 2 ) 

i °° i 1 

=n (i-<,«-•) (i-g«-ij 

= 2 p(S 3 ,n)q n ; 

n = 0 

thus £) 3 (n) = p(S 3 ,n). ■ 


EXERCISES 

1. Prove directly (that is, without using Theorem 13-1) that 

ir m (n) = 7r m (n — m) + 7r m _ 1 (n), 
where 7r m (n) is defined to be 1 if n = 0 and 0 if n < 0. 

oo 

2. Dedyce from Exercise 1 that if F m (q) = ^ 7 r m (n)q n , then 

F m (q) = q m F m (q)+F m ^(q). 

3. Use Exercise 2 to give a new proof of Theorem 13-1. 

4. Prove directly (that is, not using Theorem 13-4) that 

(I ul {n) ~ d m —i(n) + m), 

where d m (n ) is defined to be 1 if n = 0 and 0 if n < 0. 

oo 

5. Deduce from Exercise 4 that if L m (q) = ^ d m (n ) q n , then 

n = 0 

L m (q) = L m -i(q ) + q m L m ^ 1 (q ). 

6. Use Exercise 5 to give a new proof of Theorem 13-4. 

7. Prove that the number of partitions of n into distinct parts 
congruent to 1 ? 2, or 4 (mod 7) equals the number of parti¬ 
tions of n into parts congruent to 1, 9, or 11 (mod 14). 
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8. Let S be any set of integers such that 2 j E S whenever 
j 6 S. Define S' to be the set of integers j E S for which 

j 

^ $ S. Prove that the number of partitions of an integer n 

into parts taken from S' equals the number of partitions of 
n into distinct parts taken from S. 


13-2 IDENTITIES BETWEEN INFINITE SERIES 
AND PRODUCTS 

Here we shall prove some basic results relating infinite series to 
infinite products. Throughout this section, we shall ignore questions 
of convergence and rearrangement of series; these receive full treat¬ 
ment in Appendix B. 

Theorem 13-7: If \q \ <1, then 


qii(n — l)/2gM 


1+ n ? t (!-</)(!- 9 *) . . . (l-q») Uo il + Zqn) (13 ' 2 ' 1) 


and 


1 + 2 rTT- 


(1 - g)(l - q 2 ) • • • (1 -q”) n (1 - zq*) ’ 
with the stipulation that \z \ < 1 in (13-2-2). 


1 


(13-2-2) 


PROOF: We start with (13-2-1). The infinite product in (13-2-1) 
must have a MacLaurin series expansion in z (see Appendix B for a 
proof). Let 

/(*)-n (1 + z Q n )i (13-2-3) 

n = 0 

and let the MacLaurin series expansion for/(z) be 

/(*) =2 

n~ 0 


where the A n depend on q. Now 

f(z) = [J (1 + zq n ) 

n~{) 


(13-2-4) 
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= (1 + z) (1 + zq n ) 

n = 1 


= (1 + z) (1 + zq n+1 ) 

n-0 

= (1 + 2 ) n ( 1 + Z ^ n ) 

n = 0 


= (1 + z)f(zq). (13-2-5) 

Substituting (13-2-4) into (13-2-5), we obtain the relations 


2 A n z"= (1 + z) 2 A n z n q n 


oo oo 

= *2 A n z n q n + '2 A n z n+1 q n . (13-2-6) 

ri = 0 n = 0 

Now, since A 0 is the constant term of the MacLaurin series, A 0 = 
/(0) = 1. For N > 0, let us compare the coefficient of a general power, 
say z N y on both sides of (13-2-6). On the left side it is A N ; on the right, 
it is A N q N 4- A N - x q N ~ l . Since a function, in this cas ef(z ), has at most 
one MacLaurin series expansion, we find that 


Therefore, 
that is, 


A^ — A N q N 4- A N - t q N l . 
(1 ~~ q N )A n = A N -iq N 

A - qN ~ X - A 

^N— ^ | 1- 


(13-2-7) 


By repeated application of (13-2-7), we can express A# in terms of q, 
as follows: 


A n — ■ 


(1-^) 


An~1 




N-l n N -2 


( 1 -^) (1 - q "- 1 ) N ' 2 


q N ~ x _q^ 2 


(1-q*) (l-^" 1 ) d-^' 2 ) *' 3 


A n - 
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g(AT-l) + (JV-2) + (JV —3) + ... +2 + 1 + 0 ^ 

= (1- q»)(l- q*- 1 ) ... (l-q) Ao ' 

The sum of the first N — 1 positive integers is ( N 2 — N)l 2, by Theorem 
1-1, and A 0 = 1. Thus we obtain the formula 

q(N2-N)l2 

An= (l-</")(l-q"-‘) . . . (1-qV 

Hence, 

n (i+ z( t n ) = /( z ) 

n = 0 

=i 

n = 0 


= 1 + 2. rrr 


Tl) I 2 ^71 


(l-q)(l-q 2 ) . • • (1-<T) ’ 


as we asserted in (13-2-1). 

Since the proof of (13-2-2) mirrors the proof of (13-2-1), we shall 
merely outline the procedure. Let 

g <*> = i =n ( i-L.v < i3 - 2 -*> 

n = 0 n = 0 f 


Then 


(1 -z)g(z) = g(zq). (13-2 -9) 

Consequently, B 0 = 1 and 


(l-q”)B n = B n ^. 


(13-2-10) 


Therefore 


B n = 


(l-q)(l-q 2 ) . . . (l~q n ) 


—^forn^ 1. (13-2-11) 


Substituting (13-2-11) into (13-2-8), we obtain (13-2-2). ■ 


Our next result is known as Jacobi’s Triple Product Identity. 
Theorem 13-8: If z * 0 and | q | <1, then 
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J] (l-<7 2n+2 )(l + z<7 2n+1 )(l + z- , <7 2n+1 )= ^ 4" 2 z n . (13-2-12) 

m = 0 n = -co 

00 

Remark: We can think of the doubly infinite series ^ q n *z n as 

n= —oo 

the sum 

2 <7" 2 z" + 2 

m = 0 n = — 1 

of two infinite series. The last expression may look more familiar if 
in the second series we replace n by —n. Thus, 


2 <7 W V = ]T q tl2 z n +'Z q n2z ~ n • 

n= —oo n = 0 n= 1 

Proof: Assume | z | > | q |. In (13-2-1), replace q by q 2 and 
z by zq. This yields the formula 

q n *z n 


[] (l + zq 2 « +1 ) = l + 2 7 T 3 

n = 0 ' 11 

Now 

_1_ 

(1-c 7 2 )(1-</ 4 ) . . . (1 -q 2n ) 


—03-2-13) 


A (l-fl 2 )(l-fl 4 ) • • • (1 ~q 2n ) 


(1 _ (/ 2m + 2« + 2) 

m = 0 

n (l-<7 2m+2 ) 


Therefore, 


(1 + zq 2n+1 ) = 2 <Jf"V 


(l-< 7 2m+2n+2 ) 

m ~ 0 

f] (l-g 2 - + 2 ) 


Since the product in the denominator does not involve n, we may 
factor it out of the sum and obtain the formula 

H u+z<7 2 " +i )=n n-;^ i q n ** n n a-q 2m+2n+2 )- 03-2-14) 

n = 0 m = 0 V 1 q ' n = 0 m = 0 

Note that if n is a negative integer, then 

h (i-(/ 2 m+ 2 " + 2 )=o, 
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since, for this product, the term with m = — n — 1 yields 
(1 - ^-2«-2 +2n+ 2) = (i - q o) = i - i = 0. 

Hence, if we extend the range of the sum in (13-2-14) from 
0^n<°o to — oo < n < oo, we do not alter the value of the series, 
since we have introduced only terms whose values are zero. Thus, 


n a+«!»■■»)-n (1-^+*) s 

n = 0 rn = 0 V 1 W / n ——» 


(1 - q 2m + 2n + 2 ). (13-2-15) 

w? = 0 

Now, if in (13-2-1) we replace n by m and q by g 2 , and then set 
z = —g 2w+2 , we have the formula 


00 oo /_I \m rt m 2 + 2mn+m 

n . (1 - - 1 + 2 a 1 „ V .. <!-«,“) • , ' 3 - 2 -' 61 

Thus, 

n a+«,*•+■)=ft 

n —0 m = 0 \ x H / 

00 / oo /_i \ m J ~m 2 + 2mn + m \ 

°° 1 
= JJ 0 ( 1 gf2m + 2 ) 

( y «»v + y_ 

,4 d-</ 2 ) • • • (l-o 2 " 1 ) 

i 


J] q in + m)2 zA 

n = — oo / 


JJ„ (i-q 2 »* +2 ) 

( °° 00 ( 1 \ m ~ - m oo \ 

qz ‘ + s, n-W..'i- 


(13-2-17) 


Note that for each integer m, 


2 g<« + ™>V + m = 2 <7 n2 *", 



172 


PARTITION GENERATING FUNCTIONS 


since both n + m and n assume each integral value exactly once. Thus, 
fl (1 + zq 2n + 1 ) — JT[ (1 — n 2m + 2 ) 

n = 0 m = 0 ^ x c / / 

„=?. q " z " l 1 + j? t (l-g 2 )(l-^) ... (l-<7 2m ))• (13 ~ 218) 

Finally, in (13-2-2) we replace q by q 2 and z by —qz~ 1 . This yields the 
formula 


n 


i 


n = 0 (1 + ^Q 2M+1 ) 


l n 2M+l\ = 1 + 2 n - 




! (l-gf 2 )(l-d/ 4 ) . . . (l-t7 2m ) 


, (13-2-19) 


provided that | z | > | q | as we have specified. Substituting (13-2-19) 
into (13-2-18), we find that 

ft (1 + Zq2n + l) = flo (1 - <7 2m+2 ) (1 + z~V M + 1 ) n %S' Zn ' 

Multiplying both sides of this equation by 

00 

jJo (1-</*" + *)(!+ *-V" + I ). 


we obtain (13-2-12) for \z \ > \ q\ and \ q\ <1. 

Now we may repeat the entire argument with z~ l replacing z. 
Since (13-2-12) is symmetric in z and z” 1 , our final result will be the 
same. The accompanying conditions, however, will now be \z~ l \ > 
| q | and z~ l # 0. 

Since | q \ < 1, at least one of the inequalities \z \ > \ q\ and 
| z' 1 | > | q | must hold. Hence, (13-2-12) holds, provided only that 
z ^ 0 and | q \ < 1. ■ 


EXERCISES 

1. Prove that if H(z) = TT . then (l~z)H(z) = 

(1 + az)H(zq). 

2. Using Exercise 1 and supposing that 

h(z) = 2 R « zn ' 

n = 0 

prove that R 0 = 1 and (1 — q n )R n = (1 + ciq n ~ 1 )R n ~ i- 
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3. Using Exercise 2, prove that 

_ (l + a)(l + aq) . . . (l + a9 n ~ l ) 

" (1 - c/)(l - g- 2 ) . . . ( 1 - 9 ") ’ 

and from this deduce that 

1 4. v (! + «)(! + «</) • • • = A (1 + azqf”) 

„^i (1 — qr) (1 ~q 2 ) .. . (1 -q n ) 1=4 (1 - zq n ) ’ 

provided that | z | < 1 and | q | < 1. 

4. Use Exercise 3 to prove the following product-series 
identity: 

1 + V (fr + a)(fr + «q) • • • (h + aQ”" 1 )^ ff (1 + azq") 
A (l-<7)(l-<7 2 )... (l-q n ) , 14(1 ~bzq n )' 

provided that | zb | < 1 and \q \ < 1. 

5. Deduce Theorem 13—7 from the result in Exercise 4. 


In Exercises 6 through 9, you will need Theorem 13-7. Assume 
that | q | < 1 and | a | <1. 

6. Prove that 


y q' (n+ult (l-a)(l-Qq)... (l-aq-») ^ .. m . 

A (l-q)(l-q*)... (l-q») "ll / 1 ° Q > 


a r q rn 


»?o (1-flf) ... (l-<7") r ?o (1-Q) . . . (l-<7 r ) 

7. Use Exercise 6 to prove that 

f a!l”_ +1 - >/2 . a - a) (1 - aq) ■ ■ ■ (1 ■- aq-') _ A ( _ . 

A (1-9)... (1-<7") iJ 0 (1 } 


r ? 0 (1-9) .°. (l-9 r ) XI a + <7 r+s+1 ). 


8. Use Exercise 7 to prove that 
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V <i” ( " +1)/2 (l — a) (1 — off) ■. . (l-aq> w ') -ft n _ aa m\ 

,k (l-q)...(l-Q n ) JA U ^ 

ft (1 + gS) 2, . . . (l-q 2r )' 

9. Use Exercise 8 to prove that 

^ qi w( "+ 1),2 (l — a) (1 — aq) ... (1 — ag” -1 ) _ 

k (i - q) ■ ■ ■ (i - Q n ) 

fj (l- aq 2m + 1 )(l + q m + 1 ). 

m= 0 

10. Let b(n) denote the number of partitions of n into non¬ 
negative powers of 2. (Thus b( 5) = 4 since 5 = 4 + 1 = 
2 + 2 + l = 2 + l + l + l = l + l + l + l + l.) Prove that 

2 fo(n)9” = n (l-flf*")- 1 . 

n = 0 n=0 

11. Using the result in Exercise 10, prove the following three 
identities: 

b(2n + 1) = b(2n) 

b (2 n) = b (2n — 1) + b (n) 

b(n) = 0 (mod 2) for each n > 1. 


I 





CHAPTER 14 


PARTITION IDENTITIES 


14-1 HISTORY AND INTRODUCTION 

The study of partition identities originated with Euler in 1748 in 
his book Introductio in Analysin Infinitorum. Among his important 
contributions are Theorem 12-1 and his Pentagonal Number Theorem, 
proved in Section 14-2. 

Problem 12-2 of Section 12-4 led us to conjecture that the fol¬ 
lowing proposition is true. 

Theorem 14-1: 


D 2 (n) = p(S 2 ,n ), (14-1-1) 

where D 2 (n) denotes the number of partitions of n in which any two 
summands differ by at least 2, and where p(S 2 ,n) denotes the num¬ 
ber of partitions of n into parts congruent to 1 or 4 modulo 5. 

The proof we shall give is the simplest known proof of (14-1-1); 
it is nevertheless troublesome, and its discovery is a tribute to the 
ingenuity of both Rogers and Ramanujan. In 1894, L. J. Rogers proved 
a formula (equation (14-4-1)) that is a disguised form of the equation 


^ D 2 (n)q n =^ p{S 2 ,n)q”. (14-1-2) 

n =o n =0 

Note that this is merely (14-1-1) stated in terms of generating func¬ 
tions. Formula (14-4-1) is buried in the middle of Rogers’s mag¬ 
nificent but long paper, “Second memoir on the expansion of infinite 
products.” Apparently, Rogers’s contemporaries were exhausted after 
reading the first memoir, because this second work, containing his 
proof of (14-4-1), drifted into obscurity. 

In 1916, the famous Indian number theorist, S. Ramanujan, un¬ 
aware of Rogers’s work, discovered the same identity (14-4-1). Unable 
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to prove the formula, Ramanujan referred it to G. H. Hardy at Cam¬ 
bridge, but neither he nor any of his English colleagues could find 
a proof. Thus, in 1916, P. A. MacMahon, in his monumental two- 
volume work, Combinatory Analysis, discusses Ramanujan’s improved 
result. Some time later, Ramanujan was thumbing through an old 
volume of the Proceedings of the London Mathematical Society when 
he suddenly happened onto Rogers’s proof. In the ensuing decade, 
Ramanujan, Rogers, G. N. Watson, and others actively studied this 
identity and similar results. 

We shall give the proof that Rogers and Ramanujan established 
jointly; for convenience, we shall at the same time prove Theorem 
14-2, often referred to as the second Rogers-Ramanujan identity. 

Theorem 14-2: 

D 2 '(») = p(T 2 ,n), (14-1-3) 

where D 2 (n) is the number of partitions of n in which any two 
summands differ by at least 2 and all summands are greater than 1; 
p(T 2 ,n) is the number of partitions of n into parts congruent to 
2 or 3 modulo 5. 

Finally, no discussion of partition identities would be complete 
without mention of I. J. Schur. Schur, in Germany, isolated from 
English mathematicians during World War I, independently dis¬ 
covered (14-1-1) and (14-1-3) in 1917. Perhaps following the pro¬ 
cedure outlined in Section 12-4, Schur studied D 3 (n), and in 1926 he 
proved the following related result. 

Theorem 14-3: 


D 3 (n ) =p(S 3 ,n). (14-1 -4) 

Here , D 3 (n) denotes the number of partitions of n in which the 
difference between parts is at least 3, and in which no consecutive 
multiples of 3 are allowed; p(S 3 ,n) is the number of partitions of n 
into parts congruent to 1 or 5 modulo 6. 

We shall prove (14-1-4) in Section 14-5. 


14-2 EULER'S PENTAGONAL NUMBER THEOREM 

We obtain one of the most interesting corollaries of Jacobi’s 
Triple Product Identity by replacing q by q 312 , and z by —q~ 112 . This 
yields Euler’s Pentagonal Number Theorem: 
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Theorem 14-4: 

2 (— l)«g»<3n- i )/2 _ jj- (l — qr™) ? where \ q \ < 1. 

n = - co n — 1 


REMARK: In Figure 14-1, we see that the number of dots inside 
and on the nth pentagon is n(3n —l)/2. Thus, the numbers 1, 5, 
12, . . ., n(3n — l)/2, . . . are called pentagonal numbers. 

PROOF: By Theorem 13-8, with q replaced by q 312 , and z re¬ 
placed by —qf~ 1/2 , we have the relations 

2 (-1) n q n(3n-DI2 = f[ (1 - q 3n+3 )( 1 - q 3n+1 )( 1 - q 3n + 2) 

n = — oo n = 0 


=n d-9"). 

n = l 

because the three sequences {3n + 3} w = 0 > {3n-bl} w = 0 > and 
{3n + 2} m = 0 contain all the positive integers with no repetitions. ■ 
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The next result is an interesting partition-theoretic interpretation 
of Theorem 14-4. 


Theorem 14-5: Let D e (n) denote the number of partitions of n 
into an even number of distinct parts , and let D°(n) denote the num¬ 
ber of partitions of n into an odd number of distinct parts. Then 


D e (n) — D°(n) 


( -1) 3 ifn = j(3j±l)l2, 

0 otherwise. 


PROOF: We obtain this theorem by comparing coefficients of 
q n in the first and third members of the relation 


00 00 00 

1 + 2 (D e (n) — D°(n))q n =Y\ (1 — Q 171 )^^ (—1) j q j(3j_1)/2 . 

n— 1 m—l j = — oo 

The second part of the two equalities is merely Theorem 14-4. The 
first part we can prove in almost exactly the way we proved Theorem 
13-5: 


1 + 2D i(n)q n = h (1 + Q n )- 

n— l n = l 


The only change is that, when we obtain the coefficient of q n in the 

m 

series expansion of (1 — q j ), each partition of n into distinct parts 

3 = 1 

contributes +1 or —1 to the coefficient, depending on whether the 
number of parts in the partition is even or odd respectively. ■ 


EXERCISES 

1. Prove that J} (— l) n q ,n2 = f[ j-j—| . [Hint: Setz = —1 
in Theorem 13-8.] 

2. Prove that q n2+n = 2f[ t+-++- [Hint: Set z=q 

n= — oo n = 1 vq / 

in Theorem 13-8.] 


3. Prove that q {(1 - - q 2 kn+k+h)^ _ q 2 kn+ 2 k^ = 

n= " 

2 (-DV 


n = 0 
7 kn 2 +hn 
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14-3 THE ROGERS-RAMANUJAN IDENTITIES 

This section contains a five-part proof of Theorems 14-1 and 
14-2. First we establish certain important equations (recurrence 
formulae) for the partition functions under consideration. Then 
we show that these equations fully determine our partition functions. 
After this, we consider the related generating functions, and from the 
effect of the recurrence formulae on the generating functions, we are 
able to deduce the Rogers-Ramanujan identities. 


RECURRENCE FORMULAE FOR THE 8 PARTITIONS 

To begin, we must examine partitions of the type enumerated by 
D 2 (n); let 8i(m,n) denote the number of partitions of n into m 
distinct parts where 1 appears at most i times, and in which any two 
summands differ by at least 2. Our first objective is to verify the two 
recurrence equations 

8 1 (m,n) = 8 0 (m,n) + 8 0 (m — l,n — m ), (14-3-1) 


and 


8 0 (m,n) = 8 1 (m,n — m). (14-3-2) 

To verify (14-3-1), we divide the partitions enumerated by 
8 i(m,n) into two classes; those in the first class contain 1 as a sum¬ 
mand, and those in the second class do not contain 1. The elements 
of the second class are exactly the partitions enumerated by 8 0 (tn,n). 

Let us transform all the partitions in the first class by deleting 
the summand 1 from each, and then subtracting 1 from each of the 
remaining summands. This transformation leaves each partition in the 
first class with one less part, and it reduces the number being parti¬ 
tioned to n — m. Furthermore, since each partition originally contained 
1 as a summand, all of the other parts must have been larger than 2 
(by virtue of the proscription of consecutive integers in the original 
partition). Thus, after our transformation, all parts are larger than 1. 
We have therefore obtained partitions of the type enumerated by 
8 0 (m — l,n — m). 

We can reverse the preceding transformation as follows. Given 
any partition of n — m into m — 1 distinct parts, each larger than 1, 
and with no consecutive integers appearing as summands, we may 
add 1 to every part and insert 1 as a summand to produce the elements 
of the first class. In this way, we have established that there are ex- 
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actly 8 0 (m — l,n — m) elements of the first class. Since the total 
number of elements in both classes equals 8 1 (m,n ), we see that 
(14-3-1) is established. 

To verify (14-3-2), we merely apply the transformation to all 
partitions enumerated by 8 0 (n,m). Then, as above, we establish that 
there are 8 t (m,n — m) partitions being counted. 

Before we proceed toward our goal of proving (14-1-1) and 
(14-1-3), let us clarify the previous reasoning by examining a special 
case. 

Example 14-1: Consider (14-3-1) and (14-3-2) in the case 
m = 3, n = 15. The following tables enumerate all the partitions 
involved, showing the partitions that correspond under the trans¬ 
formations. 

Table 14-1: Illustration of (14-3-1) 

S x (3,15) = 8 0 (3,15) + 5 0 (2,12) 


6 1 (3,15) = 7 8 0 (3,15) = 3 8 0 (2,12)=4 


9 + 4 + 2 

8 + 5 + 2 

7 + 5 + 3 
11 + 3+ 1 
10 + 4 + 1 

9 + 1 

8 + 6+1 


9 + 4 + 2 
8 + 5 + 2 
7 + 5 + 3 


10 + 2 
9 + 3 
8 + 4 
7 + 5 


Table 14-2: Illustration of (14-3-2) 
S 0 (3,15) = M3,12) 


M3,15) — 3 M3,12) =3 


9 + 4+2 
8 + 5 + 2 
7 + 5 + 3 


8 + 3 + 1 
7+4 + 1 
6 + 4 + 2 


In studying (14-3-1) and (14-3-2), we tacitly assumed that 
neither class of partitions is empty. If one class is empty, we may 
easily validate (14-3-1) and (14-3-2) by the following definition: 

I I if m = n = 0, 

0 if either m or n is nonpositive (14-3-3) 
and not both are zero. 


Mm,n) = 8 0 (m,n) = 
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The equation 8i(0,0) = 8 0 (0,0) = 1 accounts for the “empty” partition 
of zero. 


UNIQUENESS OF FUNCTIONS SATISFYING 
RECURRENCE FORMULAE 

It is now important to show that (14-3-1), (14-3-2), and (14-3-3) 
completely define 8 1 (m,n) and 8 0 (rn,n). By “completely define,” 
we mean that if Ci(m,n ) and c Q (m,n) are arbitrary functions defined 
for all integral values of m and n, and 

Ci(m,n) — c 0 (m,n) y4- c 0 {m — l,n — m), (14-3-1)' 

c 0 (m,n) = c x {m,n — m), (14-3-2)' 

f 1 if m = n = 0 

Ci(m,n) = c 0 (m,n) = jO if either m or n is nonpositive (14-3-3)' 
[ and not both are zero, 

then 8 1 (m,n) = c x (m,n) and 8 0 (m,n) = c 0 (w,n). 

This last assertion is proved by mathematical induction applied to 
n. By (14-3-3) and (14-3-3)', 8 1 (m,n) = c 1 (m,n) and 8 0 (m,n) = 
c 0 (m,n ) for all n ^ 0. Assume that 8 1 (m,n) = c 1 (m,n) and 8 0 (m,n) = 
c 0 (m,n) for all n ^ fc, where k > 0. Then, by (14-3-2)', 

c 0 {m,k 4- 1) — c 1 (m,k 4-1 — m). 

If m ^ 0, then by (14-3-3) and (14-3-3)', c x {m,k -f 1 — m) — 
8j (m, fc 4- 1 — m ); if m > 0, then fc 4- 1 — m ^ fc, and Cj (m, k 4-1 — m) = 
8i(m, /c 4- 1 — m) by the induction hypothesis. In any case, 

c 0 (m,/c 4- 1) = c x (m, k 4-1 — m) 

— 8 1 (m,k 4-1 — m) 

= 8 0 (m,fc 4-1), 

by (14-3-2). 

Now 

Ci(ra,/c 4-1) = c 0 (m,fc 4-1) -I- c 0 (m — l,fc 4-1 — m) (by (14-3-1)') 
= 8 0 (m,fc 4-1) 4- 8 0 (m — l,fc 4-1 — m) 

= 8 1 (m,k + 1). 
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Thus we have shown by mathematical induction that 8i(m,n ) = 
Cjlm.n) and S 0 (m,n) = c 0 (m,n), for all integers m and n. 

GENERATING FUNCTIONS FOR THE 8-PARTITION 
FUNCTIONS 

We define 


G 1 (x;q)=^ l 2 &i(M,N)x M q N , (14-3-4) 

M = 0 iV= 0 

and 

G 0 (x;q) = '2 2 8 0 (M,N)x M q N . (14-3-5) 

M= 0 N =0 

The interested reader may refer to Appendix C, where it is shown that 
such series converge for | q \ < 1 and \x \ < \ q\~ 1 - 

We now use (14-3-1), (14-3-2), and (14-3-3) to study G l (x;q ) 
and G 2 (x;q). The definitions (14-3-4) and (14-3-5) lead us to three 
important identities: 


G 1 (x;q)='2 2 8 1 (M,N)x M q N 

m = a N — 0 


= 2 2 (8o(Af,N) + 8 0 (M-l,N-M))x M q N 

M= 0 N=0 

= 2 2 fi 0 (M,N)x M </*+2 J 8 0 (M-l,N-M)x M ^ 

M = 0 A' = 0 M= 0 W=0 

= G 0 (x;q)+xq 2 2 8 0 (M - 1,N - M) (xq) M ^q N ~ M 

M = 0 N= 0 

= G 0 (x;qr) + xqr 2 2 8 0 (M,N)(xq) M q N 

M~ — 1 JV = -A/-1 

= G 0 (x;qO + xqG 0 (x^/;( 7 ). (14-3-6) 


GoU;q)-2 2 8 o (M,N)x m q n 

M= 0 A 7 = 0 


= 2 2 SitM.N-M)*"?" 

M = 0 iV= 0 


Similarly, 
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= 2 2 8 1 (M,N)x M q N+M 

M = 0 N=-M 

= S 2 8 1 (M,N)(xq) M q N 

M= 0A=0 


= Gi(xq;q). (14-3-7) 

Furthermore, (14-3-3) implies that 

G 1 (0;q)=G 0 (0;q)=1. (14-3-8) 

We can now turn the tables: we reverse our steps and deduce 
(14-3-1), (14-3-2), and (14-3-3) from (14-3-6), (14-3-7), and 

(14-3-8). Thus, if we can find any functions 

gi(x;q)='£ J) C!{M,N)x M q N 

N= 0 M = 0 

and 

g 0 (x;q)=’2 j c 0 (M,N)x M q N , 

N = 0 M = 0 

satisfying the conditions 

gi(x;q) = g 0 (x;q) + xqg 0 (xq;q), (14-3-6)' 

go(x;q) =g 1 (xq;q), (14-3-7)' 

g x (0 ;q) = g 0 (0;qr) = 1, (14-3-8)' 

we may deduce that c,(M,N) and c«(M,N) satisfy (14-3-1)', 

(14-3-2)', and (14-3-3)'. Consequently, by what we established 
earlier, c x (M,N) = 8 X (M,N) and c 0 (M,N) = 8 0 (M,N). Hence g x (x;q) = 
G^x^q) and g 0 (x;q) = G 0 (x;q). 


NEW FUNCTIONS 

At this stage, then, we need some “new” functions of x and q 
that satisfy (14-3-6)', (14-3-7)', and (14-3-8)'. The ingenuity of the 
pioneers Rogers and Ramanujan produced the following functions. 
For any integer i, and for | q \ < 1 and | x \ < \ q \ -I , 
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fi(x;q) =2 


(—1 )"a: 2w q f 


n(5n + 3) -in ^ j_ x i+1 g(2n +l)(i+1) 


) 


(l-q)(l-q 2 ) . . . (l~q n ) (1 ~xq j ) 


(14-3-9) 


When n = 0, the summand is by convention just 

(i —x i+i q i+i ) / na-^), 

/ j=i 

since for n — 0, the empty product (1 — q)(l — q 2 ) . . . (1 — q n ) is 
defined as 1. When i — — 1, 

1 - ^+i^2«+i)(i+i) = l _ x o q o = 2 — 2 = 0. 

Thus, each term in the series for f^^X'^q) is zero. Therefore, 

f^(x;q)= 0. (14-3-10) 

Now 


fi(x;q)~fi- i(x;q) 

oo ^ _ ^ n x2nq^ n(5n + 3) ^q—in _ x i+lq(2n+l)(i+l) — in _ g— (i-l)n_j_^.i^(2n+ l)i — (i-l)ra^ 

n=0 (1 - q)(l ~q 2 )... (l-q n ) f[ (1~ *Q J ) 

j = n + 1 


= 2 


” (-l)n x 2 n q2 n ^ n+3) (q~ in (l — q n ) + x 'qf <2 » +1, ‘- <i - 1) "(l — xq n+1 )) 


(l <7)(1 q 2 ) ■ • • (l q n ) J2 CL-xq') 

j = n + l 


(— l)« x 2 n q2 nl5n + 3) q~ in (l — q n ) 


= 2 - 

n -° (1 - < 7)(1 - q 2 ) .. . (1 - q n ) J] (1 -xq 1 ) 

j=W+ 1 


°° (_ l}n x 2nq±n(5n + 3) x iq(2n+l)i-(i-l)n (2 _ xq n + 1 ) 

n=0 (i - q)(i - q 2 ) ■ ■ • (i - q n ) jj (i-£<7 j ) 

j = n + 1 

We note that the first term in the first sum of the last expression 
is simply (1 — q°) / J~J (1 — xq 5 ) = 0. Thus, we may really treat the 

/ j= i 

first sum as running from n = 1 to oo. Replacing n by n + 1 in the first 
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sum, we find that 


oo ^_^) n+l%2n+2q 7 j?(n+ l)(5n + 8)^—in—i 

” =0 (l - cj)(l - q 2 ) ■ ■ • (l - q n ) (l — xq*) 

j=n + 2 

oo ^ rig2nq in(5n + 3) q(2n+l)i—(i-l)n 

” =0 (1 - q , )(l - q 2 ) . . . (1 - q n ) f[ (1 - xq j ) 

j=n + 2 


= x'q 1 ^ 


^ \) n X 2n q 2 w(5n + 3) + in+n^ %2-iq(2n + 2)(2-i) ^ 


n -° (1 - q)(l - q 2 ) . . . (1 - q n ) [] (l~xqq j ) 

j = n + 1 


= *v £ 


(—l) n (xq) 2n q 2 n(5n + 3) -(l-i)njj _ (xq) 2 ~iq(2n+ l)(2-i) 


n ~° (1 - q)(l - q 2 ) . . . (1 - q n ) (1 ~xqq }+1 ) 

j= n+ 1 


= x i q i f 1 - i (xq;q). 

Now, if we set i = 1, (14-3-11) implies that 
fi(x;q) = fo(x;q) + xqf 0 (xq;q). 


(14-3-11) 


(14-3-12) 


If we set i = 0 and recall that (14-3-10) asserts that f_i(x;q) = 0, we 
see that by (14-3-11) 

fo(x;q) =f 1 (xq;q). (14-3-13) 

Finally, if we set x = 0 in (14-3-9), we see that 

fo(0;q) =fi(0;q) = 1. (14-3-14) 


form 


In Appendix C it is shown that fi(x;q) has an expansion of the 


00 03 


f i {x;q) = '2, 2 Ci(M,N)x M q N . (14-3-15) 

iV= 0 M = 0 


Equations (14-3-12), (14-3-13), and (14-3-14) are simply (14-3-6)', 
(14-3-7)', and (14-3-8)' with/replacing g. Thus we may conclude 
that/(x;q) = G t (x;q) and f 0 (x;q) = G 0 (x;q). 
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THE “COUP DE GRACE ” 

Clearly, 

X 8 1 (M,N) = D 2 (N) 

M = 0 

and 

00 

2 8 0 (M,N) — D 2 '(N). 

M= 0 

Thus, 

j]D 2 (N)q N = 2 ^ 

Ar = 0 M=0 N=0 

= G 1 (l;q) 

=fi(l;q) 

2 (-l)»g|» ( 5n+l) (1 _g4n + 2 ) 
— n = 0 

n a-fl j ) 

j=l 


00 00 

^ n^|-w(5n +1) — ^ (—l) n q^n(5n + 9) + 2 

_ n=0 n=0 

n 

j=i 

If in the second sum of the last expression we replace n by —n — 1, 
we find that 


X 


X D 2 (N)q N = 

N= 0 


n = 0 n = — 1 


g-(-»-D(-5n + 4) + 2 


00 


n d-^ j ) 

j=i 


X -CO 

^ ^_1 ) w ^2 w(5n + 1) -j- ^ (—1) n q W (5 W +1> 

n= 0 n=—1 

n a-^) 

j= i 
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(—l) n <72 n<5n + 1) 


(14-3-16) 

j= 1 

By Jacobi’s Triple Product Identity (Theorem 13-8), with q re¬ 
placed by q 512 and z by—q 112 , 

2 (—l) n gf2 w(5n+1) =n (l-c/ 5n + 5 )(l-^ 5n + 3 )(l-c/ 5w+2 ). 

« = -oo n = o (14-3-17) 

Since the five sequences {5n + 5}„ = 0 > {5n+l} n = 0 > {5n + 2}„ = 0 > 
(5n + 3} n “ 0 ? and (5n -f 4} w “ 0 contain all the positive integers with¬ 
out duplications, 

00 00 

Yi (i-^na- *-«xi - >a - ^«)(i - «* + *)<i - ^ +4 ). 

i=i »*o (14-3-18) 

Substituting (14-3-17) and (14-3-18) into (14-3-16), we have the 
formula 


00 


2 D 2 (N)q N 

N=0 


(1 —q 5n+5 )(l —q bn+2 )(l —q 5n+3 ) 

n = o 

f\ (1 - q 5n+5 )( 1 - q 5n+1 )(l - q 5n+2 )( 1 - </ 5n+3 )(l - q 5n+4 ) 

n = o 


| 00 | 1 

n o (i-<7 5n+i )(i-<7 5n+4 ) 



i 


= Y p(S 2 ,N)q N , (14-3-19) 

A'= 0 


by (13-1-2). 

Comparing coefficients on both sides of (14-3-19), we conclude 
that D 2 (N) — p(S 2 ,N), as was asserted in (14-1-1). 

We have already accomplished enough to establish easily 
(14-1-3), the second Rogers-Ramanujan identity. Since it is similarly 
derived from our formulae for fi(x;q), we shall only outline the 
procedure. 
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2 D 2 '(N)q n = G 0 (l;q) 

N = 0 

= fo(hq) 

J? (—l)"< 72 n<5n+3) 

n~ — oo 


n d-^) 

j=i 


H (1 — g ,5n+5 )(l — q 5n+1 )(l — g 5n+4 ) 

n = 0 

f[ (1 - ^ 5n+5 )(l- t/ 5n+1 )(l ~ </ 5n+2 )( 1 - g 5n+3 )(l - g 5n+4 ) 

n = 0 


00 ^ 

= n d- g »»+i)(i-„».+») 

= X P(T 2 ,N)q N . (14-3-20) 

tf =0 

Comparing coefficients on both sides of (14-3-20), we conclude that 
D 2 (N) = p(T 2 ,N) 9 as was asserted in (14-1-3). ■ 


14-4 SERIES AND PRODUCT IDENTITIES 

In this section, we shall give the two identities originally dis¬ 
covered by Rogers. 

Theorem 14-6: If | q | <1, then 

oo git 2 oo 2 

1 + „?! (l-q)(l-q*) ... (1-<7 k ) = JX (l-9 5n+1 )(l-fl 5n+4 ) ’ 

(14-4-1) 


and 

00 q n2 + n 00 1 

1 + n 5 t (l-Q)(l-g 2 ) ... (l-^) = n o (l-^» + 3)(l_ <7 5» + 4)- 

(14-4-2) 

Proof: We start with the substitution of (14-3-13) into 
(14-3-12), which yields 


fi(x;q) =fi(xq;q) + xqf^xq 2 -^). 


( 14 - 4 - 3 ) 
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Let 

fi(x;q)=2 B n x n , (14-4-4) 

n = 0 

where B n depends on q. Substituting this series into (14-4-3), we find 
that 

J B n x n = J B n q n x n + j£ B n q 2n+1 x n + 1 . (14-4-5) 

n = 0 n = 0 n = 0 

Comparing coefficients of x n on both sides of this equation, we find 
that 

B n =B n q n + q 2 ”- l B n - 

since/i(0;q) = 1, we see that B 0 = 1. Therefore, 

Q 2n ~ 1 

Bn = j _ qti Bn-1 

_ q 211 - 1 qzn-s g 

1 — q n 1 — q n ~ x 1 — q 0 

q(2w — 1) + (2n —3) + ... + 3 + 1 

= d -?)•■• (1 ~q n ) 


q n2 

(1 — q) . . . (1 — q n ) 


Hence, 


fi(x;q) = l + f i 

n= 1 


q n2 x n 

(1 ~ q)( 1 -q 2 ) . . . (1 -q n ) * 


(14-4-6) 


Recalling from (14-3-19) that 


oo 2 

= n. (1 — <7 Sn+1 )(1 — q 5n + 4 ) ’ 

we now obtain (14-4-1) by setting x=l in (14-4-6). To prove 
(14-4-2), we note from (14-3-20) that 

00 1 

„0 o (1 — q 5 ™ + 2 )(l — q,5n+3) =fo(l;q) 


=fi(q;q) (by (14-3-13)) 
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“ 1 + 2. 7T3 


7 n2 + r 




by (14-4-6). ■ 


14-5 SCHUR’S THEOREM 

Our object here is to prove the result of I. J. Schur discussed in 
Section 14-1. The proof is easier than the proof of the Rogers- 
Ramanujan identities. We shall begin as in Section 14-3; however, 
we shall never need to pull a rabbit like (14-3-9) out of the hat. First 
we must prove a result, called Abel s Lemma, from the theory of 
infinite series. 

Theorem 14-7: If lim a n = L, then 

n—>°c 


lim (1 — x) 2 a n xn — L- 

x ~* 1 n = 0 

00 

PROOF: The comparison test with the series ^ (|L| + l)x n 

n= 0 

00 

shows that ^ a n xU actually converges for | x | <1. 

« = o 

Recall that the statement “lim a n = L” means that for each e > 0, 

n-» oo 

there exists an N such that | a n — L \ < e/2 whenever n ^ N. Thus, 
(l-x) V a n x n -L = (1 x) 2 a n x n — (1 — x)L J * M 

n -0 n= 0 n =0 

(since 

(1-x) 2 (a n ~L)x n j 

n = 0 I 

(1 - x) 2 (a n -L)x n 


n= 0 


+ 


(l-x) 2 (<*n-L)x* 

n = N+ 1 
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Now let M be the largest of the numbers | a 0 — L |, | a x — L \, \ a 2 — L \, 
. . ., | a N — L |. Then, for 0 < x < 1, 


(1 — x) 2 a n x n -L 


=£ (1 — x)M(N + 1) + (1 




2 1 -x 


< (l-x)M(N + l) +|; 


this last expression is less than e, provided that 
1 _ 2M (N + 1) <x<1 - 


Thus we see that 


00 



With Abel’s Lemma among our tools, we are now prepared to 
prove Schur’s Theorem. 

Theorem 14-8 (I. J. Schur): D 3 (n) = p(S 3 ,n). 

PROOF: Let Aj (m,n) denote the number of partitions of n into 
m distinct parts, each greater than j, such that the difference between 
any two parts is at least 3 and such that no consecutive multiples of 3 
appear as parts. 

We must establish the identities 

A 0 (m,n) — A l (m,n) + A 0 (m — l,n — 3m + 2), ( 14 - 5 - 1 ) 

A!(m,n) = A 2 (m,n) 4- A 1 (m — l,n — 3m + 1 ), ( 14 - 5 - 2 ) 

A 2 (m,n) = A 3 (m,n) + A 3 (m — l,n —3m), ( 14 - 5 - 3 ) 

and 

A 3 (m,n) = A 0 (m,n — 3m). ( 14 - 5 - 4 ) 

Since the proofs of these equations are similar, all resembling the 
proofs of (14-3-1) and (14-3-2), we shall give a detailed proof only 
of (14-5-1). 
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To verify (14-5-1), we divide the partitions enumerated by 
A 0 (m,n) into two classes, those in the first class containing 1 as a 
summand, and those in the second class not containing 1 as a sum¬ 
mand. The elements of the second class are exactly the partitions 
enumerated by A 

Let us transform all the partitions in the first class by deleting the 
summand 1 from each, and then subtracting 3 from each of the re¬ 
maining summands. This transformation leaves each partition in the 
first class with one less part, and it reduces the number being parti¬ 
tioned to n — 3 (m — 1)— 1 = n — 3m + 2. Furthermore, since each 
partition originally contained 1 as a summand, all of the other parts 
had to be larger than 3; otherwise we should have had two parts dif¬ 
fering by less than 3, a proscribed condition. Thus, after our trans¬ 
formation, all parts are at least as large as 1. We have therefore ob¬ 
tained partitions of the type enumerated by A 0 (ra — l,n — 3ra + 2). 

Conversely, given any partition of n — 3m + 2 into m — 1 distinct 
parts such that the difference between any two parts is at least 3 and 
such that no consecutive multiples of 3 appear, we may add 3 to every 
part and insert 1 as a summand to produce the elements of the first 
class. In this way, we establish that there are exactly A 0 (m —1, 
n — 3m+ 2) elements of the first class. Since the total number of 
elements in both classes equals A 0 (m,n), we see that (14-5-1) is 
established. 

The proofs of (14-5-2), (14-5-3), and (14-5-4) are similar to the 
proof of (14-5-1); however, a slight complication arises in (14-5-3). 
Here, in the first class, the partitions do contain a 3, and therefore 
they must have all their other parts as large as 7, since 3 and 6 are 
consecutive multiples of 3. 

In studying the four equations (14-5-1) to (14-5-4), we tacitly 
assumed that both classes of partitions were nonempty. If one class is 
empty, we may easily validate all four formulae by the following 
definition: 


Ao (m,n) = Aj(m,n) = A 2 (m,n) = A 3 (m,n) == 


1 if m— n = 0 

0 if either m or n is nonposi¬ 
tive and not both are zero. 

( 14 - 5 - 5 ) 


The top line of equation (14-5-5) accounts for the “empty” partition 
of zero. 

For i = 0,1,2,3 and | q | < 1, \x \ < \ q |~\ define 


H i (x;q)=^ 

M= 0 


2 A i(M,N)x M q N . 

iV=0 


We use (14-5-1) to (14-5-5) to study H t (x;q). By (14-5-1), 
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00 00 


H 0 (x;q) = y y A 0 (M,N)x M q N 

M = 0 aT=0 

= V V (A X (M,N) + A 0 (M-l,N-3M + 2))x M </ JV 

— y y 

M = 0 N = 0 

+ y y A 0 (M-l,IV-3M + 2)x A V v 

M^O A^O 


00 00 


H,(x;q)+xq ]£ y A 0 (M-l,N-3M+2) 

Af = 0 JV=0 ^ X q3^M-lqN-3M + 2 


= H 1 (x;q)+xq % £ A 0 (M,N) (xq 3 )^" 

Af = — 1 N= — 3M — 1 

= Hj(x;< 7 ) + xqH 0 (xq 3 ;q). ( 14 - 5 - 6 ) 

In exactly the same way, (14-5-2), (14-5-3), and (14-5-4) imply 

that 

Hi(x;q) = H 2 (x;q) + xq 2 H 1 (xq 3 ;q), ( 14 - 5 - 7 ) 

H 2 (x;q) = H 3 (x;q) + xq 3 H s (xq 3 -,q) , ( 14 - 5 - 8 ) 

H s (x;q) = H 0 (xq 3 ;q). ( 14 - 5 - 9 ) 


Substituting (14-5-9) into (14-5-8), we find that 

H 2 (x;q) = H 0 (xq 3 ;q) + xq 3 H 0 (xq 6 ;q). ( 14 - 5 - 10 ) 

Solving (14-5-6) for H l (x;q), we find that 


Hi(x;q) = H 0 (x;q ) - xqH 0 {xq 3 ;q). ( 14 - 5 - 11 ) 

Substituting (14-5-10) and (14-5-11) into (14-5-7), we have the 
equation 

H 0 (x;q) — xqH 0 (xq 3 ;q) = (H 0 (xq 3 ;q) + xq 3 H 0 (xq 6 ;q)) 

+ xq 2 (H 0 (xq 3 ;q) - xq 4 H 0 (xq 6 -,q)). 


Simplifying this last equation, we have the relation 
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H 0 (x;q) = (l + xq + xq 2 )H 0 (xq 3 ;q)+xq 3 (l-xq 3 )H 0 (xq 6 ;q). ( 14 - 5 - 12 ) 
Define the function 

H 0 (x;q) 


h(x;q) =■ 


n ( l-xq 3n ) 


( 14 - 5 - 13 ) 


Dividing both sides of (14-5-12) by |~[ (1 — xq 3n ), we find that 

n~ l 

(1 — x)h(x;q) = (1 + xq + xq 2 )h(xq 3 ;q) + xq 3 h (xq 6 ;q) . (14-5-14) 

Now let us consider h(x;q) in the form 


h{x;q )=2 E n x n , ( 14 - 5 - 15 ) 

n = 0 

where E n depends on q. Substituting (14-5-15) into (14-5-14), we 
obtain the formula 


2 E n x n -J) £n* n + 1 = J En</ 3n % w +2 E n q 3n+1 x n+1 

n= 0 n=0 n=0 »=0 

+ J] E n q 3n+2 x n+1 + E n q en+3 x n+1 . (14-5-16) 

n-0 n=0 

By (14-5-15), (14-5-6), and (14-5-5), h(0;<j) = H 0 (0-,q) = A c (0,0) = 
1; thus £ 0 — 1. Comparing coefficients of x N on both sides of (14-5-16) 
for N > 0, we see that 

E n — E n _ i = q 3N E N + q 3N ~ 2 E N -i + q 3N ~ 1 E N - l + q 3N ~ 3 E N - 1 , 
and thus 


(1 - q 3N )E N = (1 + q 3N ~ 2 + q 3N_1 + 

Hence, 

(l + q 3V-2 )(1 + q 3 W -l ) 

n ^ — q 3N ^n -i 

(l + q 3V- 2 )(l + q 3^- 1 )(l + q 3V-5)(l + q 3^- 4 ) 

(1 — £/ 3JV ) (1 -q 3N ~ 3 ) N ~ 2 


(l + qM-^q + q 8 *- 1 ) 


n 

j = 1 


(l-</ 3j ) 


( 14 - 5 - 17 ) 
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Substituting (14-5-17) into (14-5-15) and comparing the result with 
(14-5-13), we find that 


H 0 (x;q) = (1 - xq 3n ) ^ 

n =0 m=0 


m 


n 

j =i 


(1 + Q 3J ~ 2 ) (1 + Q 3 ^ 1 ) 

(1 ~q sj ) 


(14-5-18) 


Using Abel's Lemma (Theorem 14-7), we shall now deduce Schur's 
Theorem from (14-5-18). 


VD 3 (N)^ = 2 2 A o (M,N)q" 

N= 0 A/ = 0 

= H 0 (l;q) 


00 CO rn /I J_ rj 3i-2W1 

=n (i-^)Hm(i-x) 2 xm n ( Vy> 1 

n =1 m = 0 j =1 V 4 7 


-n<—^>n (i+ ^r ,i-1) 

n = 1 j- 1 v * ' 


=n c 1 +d 3 j_ 2 )(i+d 3j_i ) 

j=i 


= 2 C>3 («)d n 
n = 0 

00 

= Jp(S 3 .»)(|", (14-5-19) 

n = 0 

where the last equation follows from the proof of Theorem 13-6. 

Comparing coefficients of q n on both sides of (14-5-19), we 
obtain Theorem 14-8. ■ 


EXERCISES 


Section 12-4 contains thirteen exercises on the formula¬ 
tion of conjectures concerning identities for various partition 
functions. Many of the possible conjectures can be proved 
using the techniques developed in Chapters 13 and 14. 
Partial results can be obtained for some of the more difficult 
exercises. 

The notation is repeated from the Chapter 12 exercises. 
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1. Let ex{m,n) denote the number of partitions of n into m 
parts in which all parts differ by at least 2 and no consecu¬ 
tive even numbers appear as summands. Let e 3 (m,n) be 
the same, except that all parts must be as large as 3. Prove 
that ei(m,n) — e 3 (m,n) + e 3 (m — 1 ,n — 2m) + e x (m — 1, 
n — 2m+ 1) ande 3 (m,n) = e 1 (m,n —2m). 

00 00 

2. Let g’aW = X 2 e a(^^)x m q n . Deduce from Exercise 1 

n = 0 m = 0 

that ^(x) — sr s (x) + xq^ x (xq 2 ) + xq 2 & z (xq 2 ) and £f 3 (x) = 
&i (xq 2 )- 

3. Deduce from Exercise 2 that, for \ q\ <1, 

v V X n q n *(l + q)(l + q*) . . . (1 + q 2n ~ x ) 

lW 4> (l-q 2 )(l-q 4 ) • • • d-q 2n ) 9 

“ x n q n2 + 2n (l + q)(l + q 3 ) . . . (1 + q 211 ’ 1 ) 

2W 4o (l-q 2 )(l-q 4 ) . . . {l-q 2n ) ' 

4. Use Exercise 3 and your conjectures from Exercises 1 and 

3 of Chapter 12 to conjecture, for W x (l) and ^ 3 (1), series- 
product identities similar to those in Theorem 14-6. 

5. Let e 2 (m,n) denote the number of partitions of n into m 
parts in which all parts differ by at least 2 and no con¬ 
secutive odd numbers appear as summands. Let e 4 (m,n) 
be the same except that all parts must be at least 2. Prove 
that e 2 (m,n) = e 4 (m,n) + e 4 (m — l,n — 2m + 1) and 
e 4 (m J n) = e 2 (m,n —2m) + e 4 (m — l,n — 2m). 

6. Deduce from Exercise 5 that % 2 {x) ==^ 4 (x) 4- xq& 4 (xq 2 ) 
and & 4 (x) =^ 2 (xq 2 ) 4- xq 2 & 4 (xq 2 ). 

7. Deduce from Exercise 6 that, for \ q\ < 1, 

v x*q«* + 1 >(l + q)(l + q*) . . . (1 + q 2n ~') 

* 4(X) n 4 (l-q 2 )(l~q 4 ) . . . (l-q 2n ) 

„ “ x n q n(n+1) (l + q)(l + q 3 )... (l + q?»- 1 )(l + sq 2n+1 ) 

ll .4 (i~q 2 )(i~q 4 ) • • • (l~~q 2n ) 

8. Use Exercise 7 and your conjectures from Exercises 2 and 

4 of Chapter 12 to conjecture, for % 2 {1) and <^ 4 (1), series- 
product identities similar to those of Theorem 14-6. 
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9. Use Exercise 9 of Section 13-2 to prove your conjectures 
in Exercise 8, thus proving also that E 2 (n) = p(W 2 ,n) 
and E 4 (n) = p(W 4 ,n). 

10. Prove your conjecture for Exercise 5 of Chapter 12 using 
the technique employed in Theorem 14-6. 

11. Let e 6 (l,n) denote the number of partitions of n in which 
no part is greater than Z, in which the difference between 
any odd part and any other part that does not exceed that 
part is at least 3, and which does not contain 1. Prove that 

e 6 (Z,n) = e 6 (Z — 1 ,n) 4* e 6 (Z,n — 2Z) 4 e 6 (Z — 2,n — 2Z 4 1). 

12. Now let <£$(x) = 2 f) e 6 (Un)q n x l . Deduce from Exer- 

n = 0 1 = 0 

cise 11 that 


& 6 (x)(l -x) =% > e(xq 2 )(l 4 x 2 c/ 3 ). 

13. Deduce from Exercise 12 that, for | q | <1 and | x | < 1, 

“ (1 4 s 2 qf 4n+3 j 

9%() a~xq 2n ) * 



15. Use Exercises 13 and 14 to prove that E e (n) = p(W 6 ,n). 

16. Let E 12 (n) denote the number of partitions of n in which 
each part occurs at least twice. Use the concept of the 
conjugate partition to prove that E x2 (n) = E 12 (n). 

17. Prove that, for | q \ < 1, 


^ E 12 (n)q n =Y\ (l + q 2n +q 3n + q 4n +...). 

n-0 n=l 

18. Prove that, for | q | <1, 


00 



(1 + q 2n + q 3n + q 4n +...) = ][[ 

n — \ 


(1 — q^ + < 7 2 ") 

(1 ~q n ) 


=n 

n= 1 


(l + q 3w ) 
(1 ~q 2n ) ' 
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19. Use Exercises 16,17, and 18 to prove thatE 12 (n) = p(W 12 ,n). 

20. Let U(n) denote the number of partitions of n in which no 
parts are congruent to 2 modulo 4, the difference between 
any two parts is at least 4, and no consecutive multiples of 
4 appear. Let Y (n) denote the number of partitions of n 
into distinct odd parts. Following the procedure used to 
prove Schur’s Theorem (Theorem 14-8), prove that 
U(n) = Y (n). 




PART IV 


GEOMETRIC 
NUMBER THEORY 


Having studied both multiplicative and additive prob¬ 
lems, we shall now use geometry to obtain information im¬ 
portant to each area. The main relationship between number 
theory and geometry lies in analytic geometry, the association 
between ordered pairs of numbers (x,t/) and points on a 
plane. For example, in the first quadrant, the number of 
lattice points (see Section 2-3 for the definition) lying on the 
hyperbola 


xy = n 


is clearly just d(n ), a function of importance in Part I. 
Similarly, the number of lattice points lying on the circle 

x 2 + y 2 = n 

is r 2 (n), the number of representations of n as a sum of two 
squares, a subject considered in Part III. Both of these geo¬ 
metric sets with their corresponding number-theoretic 
functions will be explored in Chapter 15. 
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CHAPTER 15 


LATTICE POINTS 


As you recall, a lattice point is a point in the xy-plane with in¬ 
tegral coordinates. You have seen the importance of these points in 
the study of the linear Diophantine equation and in our proof of the 
Quadratic Reciprocity Law (Chapter 9). The advanced study of lattice 
points composes an entire branch of number theory, the Geometry of 
Numbers. We shall merely sample this subject by considering two 
elementary problems: 

1. How large is r 2 (n), the number of ways of representing n as a 
sum of two squares? 

2. How large is d(n ), the number of divisors of n? 

We cannot obtain exact answers to these questions, but will have 
to settle for average values of r 2 (n) and d(n); that is, for estimates of 
2 n i N 

tt Y r 2 (n) and -rr V d{n). Our estimates will be fairly accurate. 

™ n = 0 ^ n=l 


15-1 GAUSS’S CIRCLE PROBLEM 

1 N 

The problem of estimating — y r 2 (n) is called Gauss's Circle 

Problem. In Table 15-1, note that we count a 2 + b 2=z n and c 2 + d 2 = n 
as distinct representations of n if either a ^ c or b ^ d, or both. Thus 
r 2 (l) = 4, since 1 = p + 0 2 = 0 2 + l 2 = 0 2 + (-1) 2 = (-l) 2 + 0 2 . 

Apparently, r 2 (n) assumes values irregularly, while the average 

value of r 2 (n) 3.21 in Table 15-1^ behaves more regularly, 

tending to tt as n —> oo. 

= 77 . 


Theorem 15-1: lim — X r 2 (n) 
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1 N 

Table 15-1: Values for r 2 (n) and — ^ r 2 (n) 

^ n = 0 


n 

r 2 (n) 

1 N 

w 2 '.<»> 

iy n = 0 

0 

1 

_ 

1 

4 

5.00 

2 

4 

4.50 

3 

0 

3.00 

4 

4 

3.25 

5 

8 

4.20 

6 

0 

3.50 

7 

0 

3.00 

8 

4 

3.13 

9 

4 

3.22 

10 

8 

3.70 

11 

0 

3.36 

12 

0 

3.08 

13 

8 

3.46 

14 

0 

3.21 

15 

0 

3.00 

16 

4 

3.06 

17 

8 

3.35 

18 

4 

3.39 

19 

0 

3.21 


PROOF: Let ^(N) denote the set of all points in the xy- plane, 
either inside or on the circle whose equation is x 2 + y 2 = N. Since 
r 2 (n) is the number of lattice points on the circle x 2 + y 2 = n , we see 

N 

that ^ r 2 (n) is equal to the number of lattice points in (N). Figure 

n = 0 

15-1 illustrates the case N = 4. 

With each lattice point Q , in or on ^(N), let us associate a unit 
square of which Q is the upper left hand corner (for Q = (1,1), the 
associated square is shaded in Figure 15-2). Let P(N) denote the 
region comprising all these unit squares. The shaded area in Figure 
15-3 is P (4). 

If stf (R) denotes the area of the region R, then stf (P(N)) equals 
the number of lattice points in ^(N), for each lattice point of ^ (N) 
contributes one unit square to the region P(N). In Figure 15-3, we 
see that there are 13 lattice points in ^(4), and thatj/(P(4)) = 13. 
Hence, in general 


jtf(P(AD) =£ r 2 (n). 


( 15 - 1 - 1 ) 







nn i 

n 

1 1 i 

r 2 (n) 

(Number of Lattice Points 
on the Circle x 2 + y 2 = n) 

0 

1 

1 

4 

2 

4 

3 

0 

4 

4 
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Figure 15-3 The region P(4). 


On the other hand, we see that any point on the boundary of 
P(N) must be at most V2 units away from the boundary of ^(N), 
for each unit square has a diagonal of length V2*. Thus, P(N) is con¬ 
tained in the circle centered on the origin of radius Vn 4- V2, and, 
at the same time, it contains the circle centered on the origin of radius 
VN - V2. Hence, comparing the three areas, we find that 

ff(VN-V2) 2 <J*(P(N)) < tt(VN +V2) 2 . (15-1-2) 

Figure 15-4 illustrates this in the case N = 4. Substituting (15-1-1) 
into (15-1-2), we have the inequalities 

v(N - 2 V2VN+ 2) < 2 r,(») < tt(N + 2 V2 Vn + 2); 

N= 0 

that is, 

-tt2V2VN < 2 r*(n) - (N+2)tt < tt2V2Vn. 

Thus, 


1 * 

]v 2 r2(n) 

n = 0 


(N + 2)tt 
N 


< 


7t2 V2 
Vn ’ 


( 15 - 1 - 3 ) 





15-1 GAUSS’S CIRCLE PROBLEM 


205 



Figure 15-4 Circles containing, and contained in, P(4). 


and this implies that 


lim 

N —* 


X r z(^) = J, im 

. N oo 


(N + 2)ir 


By means of some useful notation, we may modify the statement 
of Theorem 15-1. 

Definition 15-1: We say that f(N) = O (g (N)) [read f(N) is 
big “oh” of g(N)] if there exists a constant K such that 

I f(N) | ^Kg(N) 
for all sufficiently large N. 

While there are many important properties of the O-notation, we 
shall need only the following: 

LEMMA: If h (x) = O {f(x )), then 

0(h(x)) + 0(f(x)) = O (/(*)). 

Proof: We have to prove that if gfx) = 0(h(x)) and g 2 (x) = 
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0(/(sc)), then gj(x)+g 2 (x) = 0(f{x)). This assertion is almost 
obvious, since we have that |gi(x) | — Kih(x), | g 2 (^c) I — K 2 f(x), 
and | h(x) | ^ K 3 f(x). Hence 

|gi(*) +g 2 (x) | ^ \ gi(x) | + \g 2 (x) | 

=* KMx) + K 2 f(x) 

^ K.Kjix) + K 2 f(x) 

= (KiK 3 + K 2 )f{x) 

= mx). m 

Having defined the O-notation, we may write (15-1-3) as 
2 r s( n ) — 7T N ~ = O (VN), 

n = 0 

and, since 27 T = 0( Vn"), we may write the relation above as 

2 f*(n) = 7 tN+0(VN). (15-1-4) 

n = 0 

iV 

This equation asserts that 2 r a(») approximately equals tt/V, with an 

n = 0 

error no larger than a constant multiple of VN. 

Gauss’s contribution to the Circle Problem ended with (15-1-4). 
Seventy-odd years elapsed before Sierpinski, in 1906, proved the far 
stronger result 

2 r 2 (n) = ttN + 0(N 1/3 ). (15-1-5) 

n = 0 

Later mathematicians have proved this equation to be correct even 
when 1/3 is replaced by a somewhat smaller number, for example 
27/82. It is known that (15-1-5) is false when the exponent is 1/4, but 
the smallest number that can replace 1/3 has not been determined. 


EXERCISES 

1. Prove that if r 3 (n) denotes the number of representations 
of n as a sum of three squares, then 

V r 3 {n)=±7rN 3 '* + 0(N). 

n = 0 
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[Hint: Count the lattice points in a sphere of radius VN 
centered on the origin.] 

2. Prove that the number of lattice points in the region 

| y | + x 2 ^ N 

is | N 3l2 + 0(N). 

15-2 DIRICHLET’S DIVISOR PROBLEM 

Our object here is to find an average value for the number d (n) of 
divisors of n. 

Theorem 15-2: There exists a constant c such that 
2 d{n) = N logN + cN + 0(VN). 

71 = 0 

PROOF: The function d{n) counts the number of lattice points 
of the form (x 9 y) where x > o ,y> 0, and xy = n. Thus, d (n) counts 
the number of lattice points lying on the hyperbola xy = n in the first 
quadrant. Figure 15-5 illustrates the hyperbolas for n= 1, 2, 3, 4, 5, 

N 

and 6. We see that ^ d(n) is the number of lattice points with posi- 

71= 1 

tive coordinates lying under or on the hyperbola xy = N. 

Let us now carefully examine Figure 15-6. The shaded square 
region clearly contains [ VN] 2 lattice points*; two identically shaped 
regions, Rx above the square and R 2 to the right, contain the remaining 

lattice points. Now, for 1 < n ^ there are ([f] - ['VN]) lattice 

points in Rj lying on the vertical line x = n. Since the total number of 
lattice points in R 2 must equal the number in R 1? we have the equation 

J)d(n)=2 C j| 3 ([~] - [VW])+ [VN] 2 . (15-2-1) 

Now let us define {x} 9 the fractional part of x, by {x} = x — [x]. 
*[VTV] denotes the largest integer not exceeding 
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Figure 15-6 


[VN] 1 _ 

= 22VY -~N + 0(VN). (15-2-2) 

n^l n 

To finish the proof of Theorem 15-2, we must use the integral 
test from calculus (see Appendix D). This test asserts that, if f(t) is 
a positive decreasing function of t such that limf(t) =0, then there 

exists a constant k such that, for each M > 1, 

CM M 

f{t)dt=^ f(n)+k+0(f(M)). 

J 1 n = 1 



(15-2-3) 
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(15-2-4) 


Substituting (15-2-4) into (15-2-2), we obtain the estimate 
d(n)=2N log N — c 0 — O 

n= 1 

= N log N — (2c 0 +l)N + O(VN), 

and, choosing c = — 2c 0 — 1, we have Theorem 15-2. ■ 

As with (15-1-4), the O0/N) in Theorem 15-2 can be replaced 
by 0(N m ) but not by 0(N 1/4 ). 

EXERCISE 

1. Let dz(n) denote the number of ordered triples of integers 
whose product is n. Prove that 

"J) d 3 (n) — N log 2 N + O (N log N). 

n = 1 

[Hint: Count the lattice points with positive coordinates on 
or under the surface defined by xyz = N.] 


-4=Y)-n + o(Vn) 
'Vn'' 








APPENDIX A: 


A PROOF THAT lim p(n) 1,n = 1 

n—>oo • 


Lemma Is The relation p m (n) ^ (n+l) m holds for each in¬ 
teger m > 0. 

PROOF: If, in the expression a x + a 2 + . . . + a m , each a* takes 
all integral values in the interval [0,n], we obtain all partitions of 
n into at most m parts, as well as many partitions of other numbers. 
Since there are (n + l) m possible partitions, the lemma follows. ■ 

Lemma 2: The relation p(n) ^p(n-l)+p ffl (n)+p(n-m) 
holds for each integer m > 0. 

PROOF: Separate the partitions of n into 3 classes: the partitions 
of the first class contain 1 as a summand; those in the second class 
contain no Ts and have at most m parts; and those in the third class 
contain no Ts and have more than m parts. 

Deletion of a 1 from each partition of the first class leaves exactly 
p(n — 1) partitions. The second class clearly contains at most p m (n) 
elements. In the third class, subtract 1 from the smallest m summands 
of each partition; this establishes a one-to-one correspondence be¬ 
tween the elements of the third class and a subset of the partitions of 
n — m. Hence, the third class has at most p(n — m) elements. Con¬ 
sequently, p(n) ^p(n —1) +p m (n) +p(n —ra), as was to be proved. ■ 

Theorem A-l: lim p(n) lln — 1. 

n —> oo 

/ 

PROOF: It is sufficient to show that, for each e > 0, 

p(n) < K(1 + e) n (A-l) 

if n is sufficiently large; this will imply directly that 

1 ^ p(n) 1/n < K lln ( I + e) —» 1 -h € as n —» oo. 

Choose m sufficiently large so that (l + e) m_1 > 2/e. Next, choose 
n 0 so large that, for n > n 0 . 


Pm(n) <| (1+ €)"->; 
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this is possible, since p m (n) ^ (n + l) m by Lemma 1, and since 


lim 

n—»oo 


(n + l) m 
(1 + e) n ~' 


by an m-fold application of L’Hospital’s Rule. 
Now let 


K = 


( max 

\0 <n^n 0 


Pin) \ 

(l + €)»j 


+ 1 . 


Then p(n) < K(1 + e) n for all n - n 0 . 

Assume that p ( n ) < K(1 + e) ” for all n < N, where N > n 0 . Then 


p(N) p(N - 1) + p m (N) + p(N - m) 


<K(l + e) A '- 1 +| (1 + e )^ -1 +'K(1 + e) N ~ m 

<K(1 + ^"-'(l+l + TT+V^) 

< K( 1 + e) v_1 (l+| + |) = K(l + e) A . 


Hence, by mathematical induction, (A-l) holds for all sufficiently 
large n. B 



APPENDIX B: 


INFINITE SERIES AND PRODUCTS 

Convergence and Rearrangement of 
Series and Products 


Recall the ratio test from calculus: The series ^ a„ is absolutely 


convergent if lim 


a n + 1 
dn 


< 1 . 


This test is adequate to establish the convergence of the series in 
(13-2-1), (13-2-2), and all similar series appearing in Chapters 13 
and 14. 


„ in(n-l) „ 

For example, in (13-2-1), £ (1 _ ^J is absolutely 

convergent for | q | < 1, since 


lim 

n -*• 00 


#n+l 

= lim 

n —* 00 

liHn + l) M + 1 

q 2 z n+1 

/ q*’ Hn - 1) z n 

a„ 

(1 -q) ■ • • 

(1 — q n+1 ) t 

(1-q) ...(l~q n ) 


= lim 

n^°° 

n n 7 

(1 ~q n+1 ) 

O* 

o' 

II 

< 1. 


Similar results hold for the other series. 

We now exhibit several relationships between infinite series and 
infinite products; they will establish the convergence of products in 
Chapters 13 and 14. 


Theorem B-l: If a n ^ 0, then (1 + a n ) and ^ a n are both 

n=1 n=l 

convergent or both divergent. 

Proof: Since the function g(%) = e x — x — 1 has an absolute 
minimum at x = 0 and since g (0) =0, we see that 1 4- x ^ e x for all x. 
Hence, 

1 + a 1 + a 2 +.. . + a N ^f\ (1 + a n ) <z e ai + a% + - + aN . (B-l) 

n= 1 
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Consequently, if either the sequence of partial sums or the sequence 
of partial products converges, then the other is bounded and so must 

oo 

converge because both are nondecreasing. Finally, n (1 + a n ) * 0 

n= 1 

since each partial product is at least 1. ■ 


Theorem B-2: If 1 > a n so, then n (1 — a n ) and ^ a n are 

n =1 n=l 

both convergent or both divergent . 

00 

PROOF: Suppose ^ a„ converges; then there exists an N such 

n— 1 

00 2 

that ^ a n < p. Now 

(1 a^) (1 a^+i) = 1 a# #jv+i H~ a^a^ + i 

— 1 — a N — a N +! 

and 

(1 a^) (1 a^+i ) (1 a^+ 2 ) — (1 a^ ^iv+i) (1 #^+ 2 ) 

— 1 fl iV ^iV+1 #iV+ 2* 


We may proceed by mathematical induction to show that, for m^N, 


(1 (1 #jv+i) • . • (1 #m) — 1 a N a^+i . . . a m . 

m 

Consequently, if p„, = ]~J (1 — a m ), then for m s N 

n= 1 


J 9 — (1 A#) (1 ^iv+i) • • • (1 “ a m ) 

Pn- 1 


> 1 


#JV • • • #m 



1 

2* 


Now p m > 0 for all m, since p m is a product of positive numbers. 
Furthermore, 

Pm Pm+1 Pm( 1 (1 1) ) ^m+lPm ~ b; 

thus the p m form a decreasing sequence. Also, p m > | > 0. Hence, 

the sequence is bounded below by a positive number. Since every 
decreasing sequence bounded below by a positive number tends to a 
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positive limit, we see that lim p m exists and is not zero; that is, 

m —>oo 
oo 

Y\ (1 — a n ) converges. 

n= l 

00 

Alternately, begin with the assumption that (1 — a n ) con- 

n = 1 

verges. Then, since 1 — x ^ e~ x for all x (this is the inequality used in 
Theorem B-l, except that — x replaces x), we have the inequalities 

0 < c < (1 - aJU - a 2 ) . . . (1 - a m ) ^ e ’~ ai ~ a2 ~'~ a ^ 


where c < 1. Thus, 


log c < —a x —a 2 — . . . —a m ; 
that is, 


di + a 2 + . . . + a m < —log c. 


{ m oo 

2 a n \ forms a bounded increasing sequence, and so 

n=i Jm=l 

00 

2 a„ converges. ■ 

n~ 1 

The next theorem justifies the manipulation of the products in 
Chapters 13 and 14. First we define absolute convergence. 


00 

Definition B-l: The infinite product (l + a n ) is said to 

n=l 

00 

converge absolutely if (1 + | a n |) converges. 

n~ 1 

00 

Theorem B-3: If | a n | <1, and J^[ (1 + a n ) converges abso- 

n— 1 

00 

lately , then (1 4- a n ) converges. 

n~ 1 

N N 

Proof: Let P n = Y\ (1 + | a n |) and Pn^ (l + a w ).Then 

n— 1 n=l 


Pn — Pn-i I — I (1 + ^i) (1 + # 2 ) • • . (1 + 

— (1 + I I ) (1 + | I ) ... (1 + | &N-1 | ) |«iV 

= P N ~ P^v-i • 
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Consequently, if R > S, 

\Pr-Ps I = \Pr-Pr-i + Pr-i~Pr-2 + • • •+Ps+2-Ps+i + Ps+i-Ps I 
~ I Pft ~~ Pr -1 I + I Pr ~1 ~~ Pr -2 | + • • • + | Ps +2 ~ Ps+1 I 
+ I Ps+1 — Ps I 

— Pr~ Pr-1~^~ Pr~1~ Pr-2^~ • • • + Ps+2 ~~Ps +1 + Ps+1 Ps 
= Pr~Ps • 


Now we know that lim P N exists, so that {Pjv}jv=i is a Cauchy sequence. 

N-* 00 

(Recall from calculus that {a n } n =! is a Cauchy sequence, provided 
for each € > 0, there exists an M such that | ol r — | < e whenever 

R ^ S ^ M. Recall also that a sequence converges to a limit if and only 
if it is a Cauchy sequence.) Since {P n }n=i is a Cauchy sequence, the 
inequality \p R — p s \ ^P R — P S implies that {p^}v=i is also a Cauchy 
sequence. Consequently, lim p N exists. 

We shall now show that lim p N 0. Clearly, 

I Ps \ — I (1 + 0i) (1 + di) • ■ ■ (1 + a N ) | 

- (1 - | o, |)(1 - \a 2 |) . . . (1 - |ojv|) > c >0 

00 

since, by Theorem B-2, Y[ (1 — I a n I ) is convergent. Hence 

n—1 

| lim p N \ > c. ■ 

2 V —> 00 


We now apply Theorem B-3 to some of the infinite products 
appearing in Chapters 13 and 14. 


A product such as Y\ (1 + %q 5n+4 ) is absolutely convergent for 

n= 1 

| q | < 1 since 


00 


2 \zq 5n+4 

n= 1 


k 4 

1 -k 5 


00 2 

A product such as Y\ yzi - n i s absolutely convergent for | q | < 1 

n =]1 -i 

and | % | < | q~ l |, since 


00 



r^-jSU 1 


+ 


1 -zq n )' 
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and since Y ——- is absolutely convergent by comparison with 

,1^, 1 - 

1 oo 

1—r—r y | zq | 

1 - I zq I n=1 

00 CO 

If a„ is real, | a n | < 1, and ^ \a n \ converges, then (1 + a n ) = 


( oo \ oo 

2 log(l + a n )l. Now, 2 log(l + a„) converges absolutely, 

n= 1 ' n=1 

00 

since ^ I I converges. We can therefore justify rearranging the 

tt = 1 

terms of an absolutely convergent infinite product by the argument 
that rearrangement of an absolutely convergent infinite series does 
not alter the value of the sum. 


Maclaurin Series Expansion of Infinite Products 


In order to justify the use of Maclaurin series expansions for 
infinite products in Theorem 13-7 or in Theorem 14-3, we could in¬ 
troduce the subject of uniformly convergent infinite products. How¬ 
ever, there is a simpler procedure. Let us execute, without special 
assumptions, a “reverse proof” of (13-2-1). 


Trri 


~2 n(n—l ) zn 


A ... (1 -q n ) 

first term is equal to 1. Then 

i . .,. 


—, where, by convention, the 


2 ny,l ~ T± ' ~n oo 2 n(n + i) n+1 

(1 + z)F(zq) = i # (1 _ q) . . . (1 _ q n) + 2 0 (1-q) . . . (l-qr») 
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Thus, 


q* Mu+U Z n » q 4 n(n_1) z n 

»4 (1 - q ) ... (1 - 9 ") 4 (1 - q ) • • • (1 - t ?"- 1 ) 

00 a \ nin ~ 1) 7 n / Cl n \ 

= 1 + „?: (1(l-g- 1 ) (l^ + L ) 


„'!»(» —1) „ 
q z Z 


1 + „? 1 (1 - q) . . . (1 - q n ) 
= F(z). 


F(z) = (1 + z)F(zq) 

= (1 + z)(1 + zq)F(zq 2 ) 

= (l + z)(l + z<|) . . . (1 4- zq N )F(zq N+1 ). 


This series for F(z) converges uniformly for | z|< M and | q | < 1, 
by the ratio test; therefore, F(z) is continuous at z = 0. Hence, since 
F(0) = 1, 


F(z) = lim 

N-ycc 


F(z) 

F(zq N+1 ) 



=n (i+* 9 “)- 

n= X 

As you can see, we’ve made no assumption about uniform con¬ 
vergence of infinite products; all we require is uniform convergence 
of infinite series. 



APPENDIX C: 


DOUBLE SERIES 


Chapters 13 and 14 contain expressions of the form 

00 CO 

n = 0 m = 0 

in the proof of Theorem 13-8, we use the rearrangement 


^ ^m,n ^ ^ 

n = 0 m = 0 m = 0 n = 0 

which is permissible only under certain conditions. 

Our use of double series is justified by the theorems to follow. 

00 

In treating ordinary infinite series, we define ^ a n by the formula 

n= 0 


2 “» 
n = 0 


lim 

oo 



There are, however, many possible definitions for the sum of a double 

00 00 

series, ^ 2 a ^,n- We ma Y consider summation by rows or by 

m = 0 n = 0 

columns; for example 


lim 

M -*oo 


2 2 or Jl™ 2 2 


n = 0 m = o 


In the first case, if the limit is finite, we say that the double series is 
convergent by rows; in the second case, that it is convergent by 
columns. 


Theorem C-l: If a m , n - 0 for all m ^ 0 and all n ^ 0, then 


lim 

M ->« 


2 2 a m-n = Jim 2 2 


n = 0 m = 0 


(if either limit is +°°, so is the other). 
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PROOF: Suppose that 



< OO. 


oo 

Let A m = ^ a m,n ; clearly this series converges for each m. Thus 


rt = 0 


^1 A m S. 

rn = 0 

Now, a m>n ^ A m ; hence, by the comparison test, 

2 °m.» ~ S- 

m = 0 


Let B n @m,n \ then 

m = 0 




n = 0 rn = 0 


X °m,n A m = S - 

n = 0 m = 0 


Consequently, lim V B n exists; let us call this limit S'. Clearly, 
n = 0 

S' S s. 

Now we may reverse the argument, and start with 


lim V V flm.n = S'. 


n = 0 m = 0 


We can then deduce that S ^ S'. Hence S = S', and our theorem is 
established. I 


COROLLARY: If 0 ^ fc m , n ^ a m , w , and if^^a m ,n is convergent 

by either rows or columns , then ^^b m , n is convergent by both rows 
and columns, the two resulting sums being identical. 

M co N oo 

Proof: 2 2 and 2 ^ b m , n are both increasing se- 

rn = 0 n = 0 n = 0 rn = 0 

quences bounded above by ^^a m , n ; consequently each converges. 

Theorem C-l implies that the sums are identical, and this allows us to 
make the following convention. ■ 
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Definition C-l: In the light of Theorem C-l, if a m , n s 0, and 
XX»- n converges by either rows or columns , we simply say that it 

converges. 

Definition C-2: If converges , we shall say that 

2 2 a m,n converges absolutely. 

Theorem C-2: If^^a m , n converges absolutely , then 

M oo N oo 

X X “J™ X X «*»•» 

m = 0 n = 0 n = 0 m = 0 

and hot/i limits are finite. 

Proof.* We define 


__ f &m,n if Qm,n ~ 

am ’ n 10 otherwise; 

a __ f dm,n if d m ,n — 0, 

Pm ’ n to otherwise. 

Clearly, by comparison with ^ 2 I I ’ we see *bat 2 2 am w an( ^ 
X 2^ m ’ n conver g e (by both rows and columns). Hence, ^ ^ (a m ,„ — 
Pm,n ) = 2 2 am > n conver g es by both rows and columns. ■ 

As in the corollary to Theorem C-l, if | b m , n | ^ a m , n , and if 
XX«~ converges, then ^ ^ b m , n converges. 

We are now prepared to establish a result that justifies the 
double series expansions in Chapter 14. 

Theorem C-3: Suppose that 

00 

A m {q) = 2 (where a m , n s 0) 

n = 0 

converges absolutely for \q \ <1. Suppose also that a m>n = 0 for 

m > n, and that a mjm+n ^ Kc n , where K is an absolute positive con- 

00 

stant and ^ c nZ n converges absolutely for \z \ <1. Then 
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X X Q >n,nX m q n 

m=0 n=0 

converges absolutely for \ q | < 1 and | x | < | q | _1 . 
Proof: 


S (I \a m , n x m q»\) = ± (i | a m>n x m q n |) 

= 2 (i \a m , n+ mx m q n+m \) 

m = 0 \n = 0 ' 


-2 2 KCnl^l" 1 k 

m = 0 n=0 


K 

1-1*9 


2 c » 191”, 

w = 0 


where the convergence of the last series requires that | q \ < 1 and 

1*1 <\q\~ 1 - ■ 

Let us now apply Theorem C-3 to the expansions (14-3-4) and 
(14-3-5). 

The partition function 8i(m,n) takes the value 0 if m > n, since a 
partition of n into m positive parts implies that n ^ m; furthermore, 


8i(m,n-bm) ^ 8i(m,n + m) 

= 8i(m,n) 4- 8 0 (m — l,n) 
^ 2p(n). 


(by (14-3-1) and 

(14-3-2)) 


Hence, Theorem C-3 applies to (14-3-4) and (14-3-5) with K = 2 and 
c n = p(n). 

A similar argument establishes the expansion used in the proof 
of Schur’s theorem: 


H i (x;q) = '£ £ ±i(M,N)x M q N . 

M= 0 N = 0 

Here, A f (M,N) ^ 8 t (M,N) 9 and the comparison test applies directly. 

Finally, we show how to establish (14-3-15). First, applying the 
ratio test to (14-3-9), we find that 


lim 

n—» oo 


X 2 q 5n + 4 ~ i (l — £*+lg<2n + 3)U+l)) 

(1 - </ n+1 ) (1 - x i+1 q liH+1M+1) ) (1 - xff 


= 0, 
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since | q | < 1. Thus, if | q | < 1 (and | x \ < \ q | _1 , so that the de¬ 
nominator of the series for fi(x;q) is always defined), then the series 
for fi(x;q) converges. In fact, it converges uniformly for \ q\ ^ 1 — € 
and | x | ^ | q | _1 — e; therefore, fi(x;q) is an analytic function of x for 
| q | < I and \ x\ < \ q I' 1 . We can now expand fi(x;q) for i — 1 and 0 
(as in Section 14-4) using only (14-3-13) and (14-3-12). Thus, as in 
Section 14-4, 


q m2 x m 


fl{x;q) J 0 a- g)... u-<rr 

00 m 

fo{x;q] = m ?» au-<rr 


Hence, the coefficient Ci(m,n ) of x m q n in (14-3-15) is 7 T m (n — m 2 ) 
when i~ 1 and 7r m (n — m 2 — m) when i = 0. In either case Ci(m,n) ^ 
p (n); also Ci(m,n) = 0 if n < m. 





APPENDIX D: 


THE INTEGRAL TEST 


Theorem D-l: If f(t) is a positive decreasing function of t 
such that lim f(t) =0, then there exists a constant k such that, for 

t~> 00 

each M > 1, 

CM M 

f(t)dt=± f(n) + k+0(f{M)). 

J 1 n-1 

Proof: Let N = [M]. Then 

[ N f(t)dt-j?f(n)=]? (f f(t)dt-f(n))-f( 1) 

J 1 n = l w = 2 \Jn-l l 

= t a n -f{ 1), 

n = 2 

where 

fl„=f n (/(t)-/(n))d*s0. 

J n — 1 

Now 

os 2 (/(»-D-/(»)) =/(R)-/(S). 

n=«+1 n=R+l 

Thus 2 is a bounded series of nonnegative terms, and there- 

n — 2 

fore it converges. Furthermore, the string of relations in the preceding 

00 

paragraph shows that ^ a n = 0(/(M)). Thus 

n = M+l 

r m m / <*> \ 

f(t)dt= Y /(n)+ 2 +0(/(Af)) 

J 1 n — 1 \n = 2 ' 

M 

= 2 /(n) + fc + 0(/(M)). ■ 

n = 1 
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NOTES 


(See Bibliography for full details on all references) 


Chapter 1 

The proof of the basis representation theorem (Theorem 1-3) 
is by Andrews (1969). 


Chapter 2 

Much of this material is historically developed in Dickson 
(1952b), Chapter 2. 


Chapter 3 

Riordan (1958) presents an extensive introduction to combina¬ 
torial analysis, including many applications of generating functions. 

The proof of Fermat’s little theorem (Theorem 3-4) was originally 
given by Golomb (1956). 

Carmichael (1959), Part I, Chapter 4, gives our proof of Wilson’s 
theorem. 

A thorough account of the relation between elementary number 
theory and computing is given by Knuth (1969), Chapter 4. 

The history of the material in this chapter may be found in 
Dickson (1952a), Chapter 3. 


Chapter 4 

The applications of congruences to card shuffling are discussed 
by Uspensky and Heaslet (1939) and Johnson (1956). Dickson (1952b), 
Chapter 2, contains a historical development of congruences. 
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NOTES 


Chapter 5 

See the notes on Chapter 3 for references to the theorems of 
Fermat and Wilson. 

The background of the Chinese Remainder Theorem is given in 
Dickson (1952b), Chapter 2. 

Lagrange’s theorem is discussed in Dickson (1952a), Chapter 8. 


Chapter 6 

Ths history of the arithmetical functions d(n ), cr(n), and 

fji(n) is found in Chapters 5, 2, 2, and 19, respectively, of Dickson 
(1952a). 


Chapter 7 

The theory of primitive roots is developed historically in Dickson 
(1952a), Chapter 7. 


Chapter 8 

The proof that lim it (*)/* = 0 (Theorem 8-4) simplifies aversion 

x —* 00 

by Mamangakis (1962). 

Dickson (1952b), Chapter 18, gives the history of primes prior to 
1920; more recent developments are discussed by Grosswald (1966), 
Chapters 7 and 8. 


Chapter 9 

Our proof of the Quadratic Reciprocity Law was originally dis¬ 
covered by D. H. Lehmer (1957). Quadratic residues are discussed 
by Dickson (1952a), Chapter 8. 


Chapter 10 

Sums of the form 2 / n(n 2 ^j U , ca j led j aco bsthal sums, are 

n(modp) \ P ' 

mentioned by Dickson (1952b), Chapter 6, page 253. 
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Chapter 11 

Chapters 6, 7, and 8 of Dickson (1952b) are devoted to the prob¬ 
lem of representing numbers as sums of two, three, and four squares. 
Fermat’s Conjecture is treated by Dickson (1952b), Chapter 26. 


Chapters 12, 13, and 14 

Dickson (1952b), Chapter 3, gives the history of partitions prior to 
1920. An account of Ramanujan’s contributions is given by Hardy 
(1940), Chapter 6. Alder (1969) and Andrews (1970) present up-to- 
date histories of partition identities. 

The proof of Jacobi’s Triple Product Identity (Theorem 13-8) was 
independently discovered by Andrews (1965) and Menon (1965). 

The proof of the Rogers-Ramanujan identities is derived from 
Andrews (1966). 

The exposition of Schur’s theorem is an amalgamation of Andrews 
(1967) and (1968). 


Chapter 15 

Gauss’s Circle Problem is described by Dickson (1952b), 
Chapter 6. 

Dickson (1952a), Chapter 10, treats Dirichlet’s Divisor Problem. 


Appendix A 

The proof that lim p(n) lln = 1 is taken from Andrews (1971). 


Appendices B and C 

Further developments of the theory of infinite series and products 
may be found in Bromwich (1959). 




SUGGESTED READING 


For further elementary topics in number theory the reader could 
do no better than to study the text by Hardy and Wright (1960). 
Advanced topics related to Parts I and III may be found in the books 
by Ayoub (1963), Grosswald (1966), and LeVeque (1956). Vorlesungen 
iiber Zahlentheorie, a classic treatise by Landau (1947), covers the 
theorems of our Chapter 15 in greater depth. For material on algebraic 
number theory, we refer the reader to the elementary text of Pollard 
(1950) and to the very advanced book of Weiss (1963). Finally, for 
further material on partitions, we suggest the work by Knopp (1970) 
as well as the survey articles by Alder (1969) and Andrews (1970). 
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HINTS AND ANSWERS 
TO SELECTED EXERCISES 


SECTION 1-1 

1. Since l 2 = 1 = 1 • (1 + 1) • (2 • 1 + l)/6, the formula is true 
for n = 1. Assuming that the formula is true for the integers 1,2 ,... ,k, 
we see that 

l 2 + 2 2 + . . . + (fc + l) 2 = (l 2 + 2 2 + . . . + k 2 ) + (fc + 1) 2 

= k(k + l)(2k + l)/6+ (k + l) 2 
= (fc + l)((fc + l) + l)(2(Jk + l) + l)/6. 

Thus, whenever the formula is true for 1,2,.. .,fc, it is also true for 
k + 1. 

2. Hint: Prove 


l 3 + 2 3 + . . . + n 3 = |n 2 (n + l) 2 . 

7. Since 1 = F t = 2 — 1 = F 3 — 1, the formula is true for n = 1. 
As suming that the formula is true for the integers 1,2,..., k we see that 

F, + F 2 + . . . + F k+1 = (F. + F 2 + . . . + F k ) + F k+1 

— (F fc+ 2 — 1) + F fc+ i = ( F k+2 + F k+1 ) — 1 

= F* + 3 1. 

10. Since F 2 2 — FjF 3 =1 — 1 • 2 = — 1 = (—1) *, the formula is true 
for n = 1. Assuming that the formula is true for the integers 1,2,... ,k, 
we see that 

F k+2 — F k+1 F k+3 = F k+2 (F k+1 + F k ) — F k+J (F k+2 + F k+1 ) 

= F k+2 F k F| +1 = (F k+ 1 F fc F k + 2 ) 

= -(-l) k = (-l) fc+1 . 
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17. Since 1 • (1 — 1) (3 ■ 1 + 2) = 0 = 24 0, the statement is 
true for n = 1. Assuming that the formula is true for the integers 
1,2,... ,/c, we see that 

(k+l)((A+l) 2 -l)(3(/c-fl)-f2) = (k+l)(k 2 4-2k)(3k + 5) 

= fc(fc 2 -l)(3fc-f 2) + 12k(k+l) 2 . 

Now by our assumption, 24 divides k (k 2 — 1) (3k + 2). Does 24 
divide 12A: (k + 1) 2 ? If k is even, then k — 2r and 12fc(fc+l) 2 = 
24r(2r+l) 2 ; if k is odd, then A: = 2^ -h 1 and 12& (k + 1) 2 = 
48 (2s + 1) (s + l) 2 . Hence 24 does divide 12A: (A: + l) 2 , and the state¬ 
ment is proved by mathematical induction. 


SECTION 1-2 

1. 100, 112, 211. 

3. 6. 

6. Since we are representing a positive integer n, we know that 
n > 0. Since in any representation a t ^ k — 1, we see that 

n = a s k s + a s - l k s ~ 1 + . . . + a 0 — (k — 1) (k s + k s ~ x + . . . + 1) 

= (k — 1) (k s+1 — l)/(fc — 1) = k s+1 — 1. 


SECTION 2-1 

1. Take q = — | j | — 1. 

6. Since a and b are odd integers, a = 2r + 1, and b = 2s + 1. Thus 

a 2 — b 2 = (4 r 2 + 4r + 1) — (4s 2 + 4$ + l) = 4(r — s)(r — s + 1). 

Now if r — s is even, then r — s = 2m and a 2 — b 2 = 8m (2m + 1); if 
r ~ s is odd, then r — s = 2 n + 1 and a 2 — b 2 = 8(2 n + 1) (n + 1). Thus 
in any case a 2 — b 2 is divisible by 8 if a and b are odd integers. 


SECTION 2-2 

1. (a) 17; (c) 333; (d) 27. 

4. (a) 150; (c) 81; (e) n(n + l); (f) (2n - 1) (2n + 1). 
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8. If d = g.c.d.(a,b), then d\a and d\b. Thus d\(a + b) and 
d | (a — b ). Hence d | g.c.d. (a + b, a — b), so d ^ g.c.d. (a + b,a — b). 

SECTION 2-3 

1. (a) y = 4 + 2t, x = -4-3t; (c) no solutions; (e) y = 21+5t, 
x = 21 +4#. 

2. 5 apples and 4 pears. 

4. Hint: As a special case, compute the area of the triangle with 
vertices (0,0 ),(b,a), and (b, 0), and then subtract both the area of the 
trapezoid with vertices (x,0),(fe,0),(b,fl),(x,y) and the area of the 
triangle with vertices (0,0),(x,y),(x,0). 



Then consider the other possible ways the triangle (0,0), ( b,a ), 
(x,y) could be positioned in the plane. 

7. l/(o 2 + b 2 ) m . 

8. (a 2 + b 2 ) 1,2 /g.c.d.(a,b). 

SECTION 2-4 

I. 60 = 2 • 30 = 6 • 10. 

3. Hint: In the factorization of a, collect all repetitions of each 
prime factor as that prime raised to a power. For example, 36 = 
2 • 2 • 3 • 3 = 2 2 • 3 2 . 

6. (a) 11; (c) 17; (f) 1. 

9. (a) 750; (e) 1221; (f) p 2 qr. 

II. g.c.d. (39,102,75) =3; l.c.m.(39,102,75) = 33150. 

12. Hint: use Exercises 5, 7, and 10. 
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SECTION 3-1 

2. The total number of subsets of a set of n elements is 2 W by Exer¬ 
cise 1. On the other hand, there is 1 subset with no elements; there 

are ^ ^ subsets with one element, (^j with two elements, and so on. 

This alternative way of counting the subsets of S establishes the 
formula. 

5. Since ^ +( r ^^) = l + r+ l = r + 2=^^^, the formula 

is true for m= 1. Assuming that the formula is true when m is any of 
the integers 1,2,... ,k, we see that 

CM r : i )+-+r* +i ) 

=c:tn+r‘ +i H:i: 2 ) o**™*.*. 

7. Letk» = l + (") + (")+(g) + . . 

and 

HTMSMSH?)*-- 

Then 

h n + k n — 1 + ^ ^ + . . . + = 2 n (by Exercise 2), 

and 

1 - (?) + (?)-• • • + (": l ) = ° 

(by Exercise 6). Now we have two equations for h n and k n . Solving 
these equations, we find that h n = k n = 2 W_1 . 

9. Since (^j = ^ * a \^ — ~~~ * s an i nte g er > we see that 

a\ | p(p~ 1)... (p — a~\- 1). But since a <p, we must have g.c.d.(a!,p) = 
1; therefore, by Theorem 2-3, a! \ (p — 1) . . . (p — a + 1). Hence 
/p\_ (p-1) . . . (p-a + 1) 

\a)~ P ■ a! 
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12 . 



(-l)X 



(- 1 )* 



b r 


2 2 



(-1 ) s b r 


r ?o for s ?r s\{n~s) \ r\(s — r )! ( 1)8 


n 

= V 

n * h V - 

(n — r)! (— 1 )® 

Zu 

r~ 0 

r! (n — r )! r 

(n — s) ! ($ — r)! 

II 

||V|s 

53 

1 1 
** ■» 

) (~d s 

II 

(?ks:(v; 

| (~D s+r . 


Now if n — r > 1 , then by Exercise 10 (with x = 1 , y = — 1 ), 

0= (l-l)»- r = 'g ( n J r ) (-1)« 


Consequendy, the last expression in our string of equations has only 
one nonzero term, namely that in which n = r, and that term is 

(-1 ) n b n . 

13. Clearly the assertion is true for n = 1 and 2. Suppose that it is 
true for the integers 1 , 2 .,k; then 


^m+m+rn-- 

= 1 + {(VMV)H(VMV)] 

+ {(VHV)h- 

-mvmvmv)*-} 

+ f 1 + { k 1 3 ) + { k 2 4 ) + ‘ • •} = Fk + Fk ~ 1 = Fk+1 ■ 
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SECTION 3-2 

1. By Fermat’s little theorem, p\n(n p ~ l — 1). Therefore, by 
Corollary 2-3, p | n p_1 — 1. 

3. Hint: prove that n 5 = n (mod 5) and n 5 = n (mod 2). 

5. Hint: use Exercise 1. 


SECTION 3-3 

1. Any prime smaller than p divides (p —1)! and so does not 
divide (p —1)1+1. On the other hand, by Wilsons theorem, 
P I ((p-l)! + l). 

2. The expression (n — 1)! + 1 is equal to 2 for n = 1 and 2, to 3 
for n = 3, to 7 for n = 4, and to 25 for n = 5. For n > 5, (n — 1) I is 
divisible by 10 and so (n — 1 )! 4 -1 is not. 


SECTION 3-4 

i- jr^xy =x fx ( 1 - x) ~ 1 =x fx il + x+x2+x3 + --- ) 

= x(l + 2x + 3x 2 + 4x 3 + . . .) 

= x + 2x 2 + 3x 3 + 4x 4 + . . .. 


= | (2 + 3 • 2x + 4 • 3x 2 + 5 • 4x 3 + . . .) 


= ^ ( 2 ) x + ( 2 ) xl ( 2 ) x * 


+ . . . . 


6 . (l- 2 x )“ 1 = l+ 2 x+ (2x) 2 + (2x) 3 + . . . 

= l + 2x + 4x 2 + 8x 3 + . . .. ' 


7. Hint: decompose the expression 
tial fractions. 


(1 — bx) (1 — bx — x) par 
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SECTION 4-1 

1 . (a) 2; (b) 3; (c) 4. (There are other correct answers.) 

3 . ( a 0 x r + a 1 x r ~ 1 + . . . + a r ) — ( a 0 y r + a 1 y r ~ 1 + . . . + a r ) = 
a 0 (x r — y r ) + a^x^ 1 — y r ~ 1 ) + . . . + ar-iix — y). Hence we need 
only show that m | (x r — y r ) for each positive integer r. This is clear 
since rn \ (x — y), andx r — y r = (x — y) (x r ~ 1 + x r ~ 2 y +. . . + xy r ~ 2 + y r_1 ). 

4. This is just a special case of the cancellation law. 

7. (b) yes; (c) no; (e) yes. 

SECTION 4-2 

1. All are complete residue systems modulo 11. 

2 . (a) and (d) are reduced residue systems modulo 18. 

4. Yes, every number counted by w(n) is also counted by 
c p(n), and since g.c.d.(l,n) = 1 , we see that there is always a non¬ 
prime integer (namely 1 ) smaller than n that is relatively prime to n. 

w(n ) = <f>(n) - 1 for n = 1,2,3,4,6,8,12,18,24,30. 


SECTION 4-3 

1. 3, 6 , 12. 

SECTION 5-1 

1 . (a) 7; (b) 5, 20; (c) 3, 8 , 13. 
3. (a) 6 ; (b) 4; (c) 10. 

SECTION 5-2 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

27 

15 

3 

30 

18 

6 

33 

21 

9 

36 

24 

12 
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3. {a — 1) (1 + a + . . . 4- a Mm) x ) = —1; nowm | —1 and 

g.c.d.(a ~ 1 ,ra) == 1. Hence m | (1 + a + . . . + a < ^ (m) " 1 ). 

5. 41 75 s (-1 ) 75 =— 1=2 (mod 3). 

7. Hint: 10 n = l n s 1 (mod 9). 

8 . If x = a p ~ 2 b , then ax = a p ~ 1 b = 1 • b = b (mod p). 

9. Since p = 1 (mod 4), we may write p = 4t+l. Hence 
(4 1 )! = —1 (mod p), and 

(40!= (2t)! (2t + 1) (2t 4- 2) . . . (2f+ 2f) 

= (2t)\(p—2t)(p — ( 2 f — 1 )) . . . (p- 1 ) 

= ( 2 t)! (—2t) (—(2t — 1 )) . . . (- 1 ) (mod p) 

- ( 2 t)!(-l) 2 '( 2 f)l ^ ( 2 t) ! 2 (mod p), 

and 2 f —|(p — 1 ). 

10. (a) 720; (b) 40320. 

14. There are no primes in this interval since m \ + j is divisible 
by j for 2 ^ j ^ m. 

17. 161 + 1 = 20922789888001 = 17 • 1230752346353. This is 
certainly a candidate for the most inefficient possible test for de¬ 
termining whether 1093 is a prime. 

20 . (a) <D n (O n - 2 ) = ( 2 2 ” + 1 )( 2 2 ”— 1 ) = ( 2 2 ”) 2 -l = 2 2 n+ 1 -l. 

(b) If b = ca , then 

2 * — l = ( 2 «) c — 1 = ( 1 + 2 a +. . .+ 2 a(c “ 1 ) )( 2 a -l). 

(c) Since n + 1 ^ 2 n (by Corollary 1-1), 2 n+1 1 2 2 ”; hence, 
by (b), 

( 2 2 " + 1 -l) | ( 2 22 ”-l). 

(d) (2 22 ” +1 -2) =2(2 2Z "-1). 

(e) <J>„| ( 2 2 ” + 1 -l) | ( 2 22 "-l) | ( 2 22 " + 1 - 2 ) = 2 4 > n- 2 . 

22. By Exercise 21, 2* = ±1 (mod p), and (! = ±1 (mod p) by 
Exercise 9 or 15. Hence 


-1 - (21)1-2*1!-1-3-5... (21-1) (mod p) 
= ±1 -3 -5 .. . (21 — 1) (mod p). 
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SECTION 5-3 

1. (a) All integers congruent to 23 modulo 30. 

4. Hint: in Theorem 5-4, choose m x as the square of the ith prime. 
Then consider the congruences 

x = 0 (mod 2 2 ) 
x + 1 = 0 (mod 3 2 ) 


x + n = 0 (mod p n 2 ). 

6. 29. 


SECTION 5-4 

1. (a) No solutions; (b) all integers congruent to 7 modulo 13; 
(c) all integers congruent to 4 modulo 7. 

7. Both sides of the congruence are congruent to zero for 
x= 1,2,. . . ,p — 1. Since (x p_1 — 1) — {x — 1) (x — 2) . . . (x — p + 1) is 
a polynomial of degree less than p — 1, it must be congruent to zero 
modulo p for each integer x. 

8. Wilson’s theorem. 


SECTION 6-1 

1. If m = p j q k . . . r l is the prime factorization of m and j ^ 2, 
then by (6-1-2), p | p(p j_1 — p j ~ 2 ) = p j — p i_1 = </>(p j ) I <M m )* Hence 
p | (f>(m) and p~|~m — 1. Therefore <£(m)~1"m — 1. 

2. If m has only two distinct prime factors and <f>(m) \ m — 1, then 
by Exercise 1, m = pq where p and q are distinct primes. Thus 

m — 1 _ pq — 1 _ pq — p — q + l + p + q—2 

0(m) (p — l)(q — 1) (p-l)(q-l) 

= (p-l)(q-l) p~ 1 . q-l 

(p-l)(q-l) (p — l)(q — 1) (p-l)(q-l) 
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Since we assume that (j)(m) \ m — 1, then (m — l)/(f>(m) must be an 
integer; however, 1 < 1 + ^ 1 + — ^ 3 if p and q are primes. 


Thus we must have 


p — 1 


■ + - 


V -1 Q-l 


= 1 or 2, 


and this is possible only for p = q = 2 or p = q = 3. Finally, we ob¬ 
serve that each of these solutions is inadmissible because p and q are 
distinct primes. 


5.<m») =» n ( i- i)- n n ( i_ |) =n ii 

pin' ** f Pin ' 1 ' Pin 


2 -" 2 " 


7. Yes. If Goldbach’s conjecture is true, then for each n there 
exist primes p and q such that 

2n + 2 = p + q = <t>(p) +1 + <f>(q) +1. 


Hence 


2 n= 0(p ) + cj>(q). 

8. Hint: if nx 0 + my 0 = nxj + my, (mod mn), where g.c.d.(x 0 ,m) = 
g.c.d.(x 1? m) = g.c.d.(y 0 ,n) = g.c.d.(y,,n) = 1, then 


n(x 0 — xj = m{y 1 — y 0 ) (mod mn). 


By Theorem 5-1, this is possible only if n\m(y 1 — y 0 ), but 
g.c.d.(n,m) = 1; hence «| (t/i — y 0 )- A similar argument shows that 
m| (x 0 — x t ). 


12. (a) 4>(4s + 2) = 0(2(2s + 1)) = (f)(2) 0(2s + 1) = 0(2s + 1). 

(b) *(1) = 0(2) = 1, 0(3) = 0(4) = 0(6) =2, 0(5) =0(10) = 
0(8) = 0(12) = 4, 0(7) = 0(9) = 0(14) = 0(18) = 6, 
0(11) = 0(22) = 10, 0(13) = 0(21) =12, 0(15) = 0(16) = 
0(20) =8, 0(17) = 0(32) = 16, 0(19) = 0(38) = 18. 

13. 0(11”) = 11" - ll*" 1 = IP -1 ■ 10. 

14. 0(2 2n+1 ) = 2 2n+1 — 2 2n = 2 2n (2 — 1) = 2 2 " = (2 M ) 2 . 
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SECTION 6-2 

1. We say that x and y are complementary divisors of n if xy = n. 
Thus we may separate the divisors of n into pairs of complementary 
divisors. This means that there will be an even number of divisors 
of n except when one (or any odd number) of the pairs of comple¬ 
mentary divisors does not contain two distinct integers. This occurs 
only when x = y or x 2 = n. Hence if n is a perfect square, then n has 
an odd number of divisors. Otherwise n has an even number of 
divisors. 

4. Hint: if x + (x + 1) + . . . + (x + r) = n, then 
(r+l)x + ^r(r+l) — n. 

7. Hint: since 1 + p + . . . + p n = ^ _ j - ,1+2p + 3 p 2 + . . . 

d d + l — 1 

+ „ p ,-. , _ , 1 + „ + . . . + p .) , _ 

9. 2 n = <r(n) = 2 ^ = 2 7 =n 2 hence 2 n. 

d\n d\n a d\n a d\n U 

12. If n = p, a prime, then 

(fr(n)q-(n) + 1 _ (p — 1) (p + 1) + l _ p 2 — 1 + 1 _ 
n p p 

If n — p j m , where j ^ 2 and g.c.d. (p,m) = 1, then 

4>(n) cr(n) + 1 _ <ft(p J ) 4>(m) cr(n) + 1 _ p (p 5 ~ x — p j ~ 2 ) cr(n) + 1 

n p j m p j m 

Hence p is not a factor of the numerator, but it is a factor of the 
denominator. 


SECTION 6-3 

1. Since/(n) = n k is obviously multiplicative, ^ f(d) = ^ d k = 

dI n d\n 

o>(n) is also multiplicative by Theorem 6-7. 
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2. Since both/(n) and p.{n) are multiplicative, so is f(n) n(n), 
and thus by Theorem 6-7, so is 

F(n)=2 p(d)f(d). 

d I n 

Thus if n = p"'p 2 2 • ■ ■ P? r , then 

F(n) = F(p 1 a, )F(p 2 “ 2 ) . . . F(p“ r ) 

= (1 -/(P,))(l -fit*)) • • • (1 -f(Pr)) 

=n a -f(p))- 

P I n 


7. Since fi(n) and <f>(n) are multiplicative and <f>(n) 0, /x 2 (n)/<£(n) 

is multiplicative and thus ^ fi 2 (d) / (d) = G(n) is also multiplica- 

d I n 

tive by Theorem 6-7. Hence, if n = p “ 1 pi 12 . . . pf r , then 
G(n) = G(p 1 ai )G(p 2 a: ‘) . . . G{p r a r) 


1+ . 1 \ (i+ 1 . .) = fit Pj— 

\ 4>(Pr )/ Pl-1 Pr~ 1 

11 n 


J_ • 

" l-±~ 

n( 



Pi 

Pr 

Pi) 

\ Pr) 


n 

0(n) ' 


13. ^ p(n)/(x/n)=^ M”) X g(*/ mn ) 

w=l n= 1 m= 1 


= 2 p(n)g(x/mn) g(x/k) p(n) 

mn^x k = 1 mn = k 

= g(x). 


14. Let a* be the ith divisor of n. Then 


ax + fl 2 + ■ • ♦ + #d(n) > 

d(n) 




SECTION 7-1 

1. (a) If r = ind^a and s = ind g b, then 

g r = a = b = g s (mod m). 
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Hence, since g belongs to the exponent </>(m) modulo m, we see that 
| r — s; that is, ind^a = ind g b (mod cf>(m)). 

2. We only need a table for the positive integers less than 17: 


a 

i 

2 1 

3 

4 1 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

ind 3 a 

16 

14 

1 

12 

5 

15 

11 

10 

2 

3 

7 

13 

4 

9 

6 

8 


3. ind 3 9 + ind 3 x = ind 3 ll (mod 16) 

2 + ind 3 x = 7 (mod 16) 
ind 3 x s 5 (mod 16). 

Hence x must be congruent to 5 modulo 17. 

6. 2,3 (mod 5); 2,5 (mod 9); 2,6,7,8 (mod 11); 2,6,7,11 (mod 13); 
none (mod 15). 


SECTION 7-2 

1. Since j = h * (fc/g.c.d.(h,fc)) = k • (h/g.c.d.(h,fc)), we see that 
both h and k divide j. Hence 

a j = 1 (mod m), and a j = 1 (mod n); 

therefore, m | a j — 1 and n | a j — 1. However, g.c.d. (m,n) = 1, and so 
mn | a j — 1, or a 5 = 1 (mod mn). 

3. If n ^ 3, and g.c.d.(a,2 n ) = 1, then by Exercise 2, 

a 2 ” -2 = 1 (mod 2 n ), 

but <^(2 W ) =2 n_1 > 2 W_2 . Thus, no integer belongs to <£>(2 n ) modulo 
2 n , and so there are no primitive roots modulo 2 n . 

7. (g + p) p ~ x — g p ~ 1 4- (j* j ^ g p ~ 2 P + terms involving p 2 

= g p_1 + (p — 1 )g p ~ 2 p (mod p 2 ) 

= 1 — g p ~ 2 p (mod p 2 ). 

If this last expression were congruent to 1 modulo p 2 , it would imply 
that p | g, an impossibility. 
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9. Hint: 4>(p m ) = p m - p m_1 . 

10. By assumption, the assertion is true for m — 2. Assume that it 
is true for the integers 2,3,... ,k. Now 


g Mpk i) = gi> k 2 (p i) == l (mod p k x ). 

Hence since the assertion is assumed true for Zc, we must have 
gP k 2 (p- i) = i rp k '~ 1 , where p^r. Therefore, 

gP k ~ 1 (p- 1) = (gi) k ~ 2 (p-l)^p — _j_ r pk-l^p 

= 1 4- ( j) rp k ~ 1 4- terms involving p k+1 
= 14- rp k (mod p k+1 ). 

Hence, 

gP k ~Hp-i) ^ x ( mo d p ^+i) ? 


since p~\~r. 

15. One modulo 6, two modulo 7, none modulo 8, two modulo 9, 
two modulo 10. 


SECTION 8-1 

1. See Exercise 20 of Section 5-2. 

2. If n > ra, d | 3> m , and d | <J> n , then by Exercise 1, d 12. Hence d 
is either 2 or 1 , and since is odd, d= 1 . 

3. Let p n be a prime factor of Then p x ,p 2 ,Pz, • • • must be an 
infinite sequence of distinct primes since no has any prime factor 
in common with any other O m . 

4. Hint: Assume that there are exactly r primes . . . ,q r 

that are congruent to 5 modulo 6. Examine the prime factors of 
6q x q 2 . . . q r — 1. 

8. a 2 = 3, a 3 = 5, a 4 — 7, a 5 = 11, a 6 = 13, a n is the nth prime. 

9. 2 = p x < 2 21 4-1 = 5. Assume the assertion for 1,2,... ,k. Then 




HINTS AND ANSWERS 


247 


p k+1 ^PiP 2 ...p*+l *2* , -2**.. .2 2fc +l=2 2+22+ - +2fc + l 

= 2 2fc+l-2 _|_ 1 < 2 2 fc+1 + 1. 

12. e(x) = ^ log P S 2 log x = log x 2 1 = ^(*) l°g x - 

P^X P = X P^X 

13. = (2-1)(3-1)(5-1) ... (p-1) >1. Thus there 
exist integers not exceeding M having no prime factors among 

2,3,5 

14. * = 41. 

15. 20! = 2 18 3 8 5 4 7 2 11 • 13 • 17 • 19. 


SECTION 8-2 


1. »<125x) - „ (*) > ^ kUb - 30 108 2 T 

1 o ( 3125 _ 30 \ 

x (log 125x log x/ ’ 


and if x is sufficiently large, lo glfe ~ log x+log 125 > 3T^5' 
Consequently, for x sufficiently large, 7 t( 125 x) — tt{x) > 0. 

2 4 

2. Since - n < p ^ n, we see that - n < 2p s 2n, and 2n < 3p. 

O O 

Hence, there is exactly one term among (n + 1), . . . ,2n that is divisi¬ 
ble by p (namely2 p), and there is exactly one term among 1,2,. . .,n 

/2n\ 

that is divisible by p (namely p itself). Hence ( ) =2n(2n — 1) . . . 

(n + l)/n! has p appearing once in the denominator and once in the 
numerator. Therefore, since these factors cancel, we see that p is not a 

factor of 



SECTION 9-1 

1. (a) 2 2 = 4 = — 1 (mod 5); therefore, 2 is not a quadratic residue 
modulo 5. 

(b) 4 3 = 64 = 1 (mod 7); therefore, 4 is a quadratic residue 
modulo 7. 
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(c) 3 5 — 243 = 1 (mod 11); therefore, 3 is a quadratic residue 
modulo 11. 

(d) 6 6 = 46656 =—l (mod 13); therefore, 6 is not a quadratic 
residue modulo 13. 


SECTION 9-2 

1. Suppose c = p t p 2 . . . p r ; then 

/ ah\ _ fab\ (ab\ / ab\ 

\c ) \Pi)\P 2 / m ' m \Pr) 

_/ a\(b\(a\(b\ la\ / b \ 

\PJ \Pl) \P2/ \Pi) ' * * \Pr) \Pr) 

= / a \ l a \ / a \ / b \ / b\ / b \ _ / a\ / b\ 

\Pl) \P2/ ’ ’ ‘ \Pr) \Pl) \P 2 / * ’ ‘ \Pr) ~ \c) \c) * 


SECTION 9-3 

1. If c = pjp 2 . . . p r , then 

_ f 1 if an even number of the p t are congruent to 3 (mod 4) 
1 if an odd number of the p, are congruent to 3 (mod 4) 

_ f 1 if c = 1 (mod 4) 

1—1 if c = —I (mod 4) 

= (-l)2 (c_1) * 

4. The least residues modulo 19 of 17,34,51,68,85,102,119,136, 
and 153 are —2, —4, —6, —8, 9, 7, 5, 3, and 1 respectively. Hence, 

— (—1) M = (—l) 4 = 1. Therefore, 17 is a quadratic residue 

modulo 19. 
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(§)-m 


SECTION 9-4 

L (S) - (I?) - (if) - (tf) (A) - (t?) “ (f) - (!) - 

Hence, x 2 = 17 (mod 29) has no solutions. 

2. This problem is equivalent to x 2 = 4 (mod 23), and x = 2 is an 
obvious solution. 

4. Hint: (x + a) 2 + b = x 2 + 5x — 12 (mod 31) holds provided 
that 2a = 5 (mod 31), and a 2 + b = —12 (mod 31). Hence, we may 
take a = 18 and b = 5. To finish we need only solve the congruence 
y 2 m -5 (mod 31). 


5. If p = 12k + 1, then p = 1 (mod 4), and ^j = 
0-^ = 1. The other cases are handled similarly. 


SECTION 10-1 




5. The number of solutions is 
, V /n 2 + 3n + 2\ , V ( 

p + 2 (-p-) = P + 2 V 

n = 0 ' " / n = 0 a 


(n + 1) (n + 2)\ _ 


P 


) = p - 1 . 


6. Hint: 


, f 1 if d is a square-free quadratic 

2 |l + jj | /x(d) | = | residue modulo p 


[0 otherwise. 


SECTION 10-2 

2. The number of solutions is 


'W /n 3 + 3n 2 + 2n 


p-i / 

p + 2 ( 

W s= 0 ' 


p 


) ■ ” + 2 ( 


^ 1 /n(n + 1) (n + 2) N 


P 


IS?:? 


if p = 3 (mod 4) 
(mod 4). 
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SECTION 11-1 

1. 274625 = 5 3 13 3 = 5 2 5 • 13 2 13 = 65 2 (2 2 + l 2 ) (3 2 + 2 2 ) 

= 65 2 (8 2 + l 2 ) = 520 2 + 65 2 . 

2. 333 = 9 • 37 — 3 2 (6 2 + l 2 ) = 18 2 + 3 2 . 

SECTION 11-2 

9. Suppose that 4 a (8m 4- 7) = x 2 + y 2 + z 2 where a > 0; then 
x, y, and z - must all be even. Otherwise, x 2 H- y 2 4 z 2 ^ 0 (mod 4). 
Hence, x = 2x 0 , y = 2t/ 0 , and z = 2 z 0 ; therefore, 

4«(8m + 7) = 4 (x 0 2 + y 0 2 + z 0 2 ), 
or 

4 a ~ 1 (8m + 7) = x 0 2 + t/ 0 2 + z§. 

We may continue this process until we reduce the exponent of 4 to 
zero. Consequently, 4 a (8m + 7) is a sum of three squares only if 
8m + 7 is. If 8 m + 7 = x 2 4 y 2 H- z 2 , then 

x 2 4- y 2 4- z 2 = 7 (mod 8). 

However, since a perfect square is congruent to one of 0, 1, or 4 
modulo 8, we see that 

x 2 4- y 2 4- z 2 = 0,1,2,3,4, 5, or 6 (mod 8). 

Thus 8m 4 7 is not equal to the sum of three squares. 


SECTION 12-2 

1. (a) ... . 


6 

1+141+1+1+1 

5 + 1 

2+1+1+1+1 

4 + 2 

2+2+1+1 

4 + 1 + 1 

3 + 1 + 1 + 1 

3 + 3 

2 + 2 + 2 

3 + 2 + 1 

3 + 2 + 1 

3 + 1 + 1 + 1 

4 + 1 + 1 

2 + 2 + 2 

3 + 3 

2+2+1+1 

4 + 2 
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SECTION 12-3 


9 

9 

7 + 1 + 1 

7 + 2 

5 + 3 + 1 

5 + 3 + 1 

5+1+1+1+1 

5 + 4 

3 + 3 + 3 

6 + 3 

3 + 3 + 1 + 1 + 1 

6 + 2 + 1 

3 + 1 + 1 + 1 + 1 + 14*1 

4 + 3 + 2 

l+l+l+l+l+l+l+l+l 

8 + 1 


SECTION 12-4 

10. Hint: the answer is not the set of positive integers congruent 
to 1, 2, or 3 modulo 6. 


SECTION 13-1 

1. Hint: Separate the partitions into two classes: (1) those that 
have m as a summand, and (2) those that do not. 

4. Hint: See the preceding hint. 

7. f[ (1 + < 7 7n+1 )(1 + q 7n+2 )(l + q 7n+4 ) 

n = 0 

“ (1 - q“”+2)(l - a 14«+4) (1 _ q,14« + 8) 

JJo (1 - <7 7 " +1 )(l - q 7n+2 )( 1 - <7 7n+4 ) 

_ » _ (1 - q 14n+2 )(l ~ q 14n+4 )(l ~ q 14re+8 ) _ 

“JLL (l-q^ + i)(l-q^+»)(l-q 14n+2 )(l-q 1 * n+9 )(l-q 14n+4 )(l-q Un+n ) 

= n (1 - q 14n+1 ) (1 - <7 14 "+ 9 )(1 - q 14 "+x') • 


SECTION 13-2 

4. Hint: in the result of Exercise 3, first replace z by bz, and then 
replace a by alb. 

6 . 

* qH n+1> (1 —a)(l —aq) ■ . ■ (1-aq”- 1 ) 

„4 (1 — </)(1 — <7 2 ) • • • (I - <7 n ) 
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q 2 n(n+ 1) 




J] (1 -aq m ) 

m = 0 _ 

00 

J] (l-aq m+n ) 


a r q rn 


J3, (1 ^ S‘ 0 (l-q)(l-q 2 )...(l-q n ) £ 0 (l-qXl-q*)...(l-q r ) 
7. Continuing from the preceding answer: 


i— r . x _ ti - n T n(n+l)+rn 

-jja-ao 2 ^ )(1 _ < , i ,,., (1 _ rt s, a -, )( 'iL < ,. ) .,, 1 - q , ) 

00 00 r 00 

ft< 1 + «.>• 


SECTION 14-2 


1. Setting z-1 in Theorem 13-8, we see that 
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Multiple, least common, 22 
Multiplicative functions, see Arith¬ 
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metic functions. 
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computational, 44-48 
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